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Argument Map in DNA Strand ssarv2 
from the '/v/lib/6mers'f ile. 
Translation shown at open reading frames. 
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1 CTGGAAGGGCTAATTT6GTCCCAAAGAAGACAAGA6ATCCTTGATCTGTG6ATCTACCACAC 
GACCTTCCCGATTAAACCAGGGTTTCTTCTGTTCTCTAGGAACTAGACACCTAGATGGTGTG 

26 mboll, 50 binl, 

63 ACAAGGCTACTTCCCTGATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCACT 
TGTTCCGATGAAGGGACTAACCGTCTTAATGTGTGGTCCCGGTCCCTAGTCTATAGGTGA 

107 binl, 113 ecor5, 

123 GACCTTTGGATGGTGCTTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAA 
CTGGAAACCTACCACGAAGTTCGATCATGGTCAACTCGGTCTCTTCCATCTTCTCCGGTT 

172 mboll, 

183 TGAAGGAGAGAACAACAGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGA 
ACTTCCTCTCTTGTTGTCGAACAATGTG6GATACTCGGACGTACCCTACCTCCTGCGCCT 

243 GAAAGAAGTGTTAGTGTGGA6GTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGA 
CTTTCTTCACAATCACACCTCCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCT 

296 aval, 

303 GCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGC 
CGACGTAGGCCTCATGATGTTTCTGACGACTGTAGCTCGAAAGATGTTCCCTGAAAGGCG 

314 seal, 

363 TGGGGACTTTCCAGGGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATGC 
ACCCCTGAAAGGTCCCTCCGCACCGGACCCGCCCTGACCCCTCACCGCAGGGAGTCTACG 

423 TGCATATAAGCAGACTGCTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAG 
ACGTATATTCGTCTGACGAAAAACGGACAT6ACCCAGAGAGACCAATCTGGTCTAGACTC 

474 bglll, 

483 CCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 
GGACCCTCGAGAGACCGATTGATCCCTTGGGTGACGAATTCGGAGTTATTTCGAACGGAA 

488 sacl, 518 af 111. 532 hindlll, 

543 GAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCA 
CTCACGAAGTTCATCACACACGGGCAGACAACACACTGAGACCATTGATCTCTAGGGAGT 

603 GACCCTTTTAGTCAGTGTGGAAAAATCTCTAGCAGTGGCGCCCGAACAGGGACGCGAAAG 
CTGGGAAAATCAGTCACACCTTTTTAGAGATCGTCACCGCGGGCTTGTCCCTGCGCTTTC 

639 narl, 

663 CGAAAGTAGAACCAGAGGW 

GCTTTCATCTTGGTCTCCTCGAGAGAGCTGCGTCCTGAGCCGAACGACTTCGCGCGTGTC 

680 sacl, 



723 CAAGAGGCGAGGGGCGGCGACTGGTGAGTACGCCAATTTTTGACTAGCGGAGGCTAGAAG 
GTTCTCCGCTCCCCGCCGCTGACCACTCATGCGGTTAAAAACTGATCGCCTCCGATCTTC 

783 GAGAGAGAGATGGG^GCGAG^ 

CTCTCTCTCTACCCACGCTCTCGCAGCCATAATTCGCCCCCTCTTAATCTATTTACCCTT 



GAG 
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Lysl leArgLeuArgProGlyGIyLysLysLysTyrLysLeuLysHisI leValTrpAla 
843 AAAATTCGGTTAAGGCCAGGGGGAAAGAAAAAATATAAGTTAAAACATATAGTATGGGCA 
TTTTAA6CCAATTCCGGTCCCCCTTTCTTTTTTATATTCAATTTTGTATATCATACCCGT 

SerArgGluLeuGluArgPheAlaValAsnProGlyLeuLeuGluThrSerGluGlyCys 
903 AGCAGGGAGCTAGAACGATTCGCAGTCAATCCTGGCCTGTTAGAAACATCAGAAGGCTGC 
TCGTCCCTCGATCTTGCTAAGCGTCAGTTAGGACCGGACAATCTTTGTAGTCTTCCGACG 

959 pstl, 

ArgGlnlleLeuGlyGlnLeuGlnProSerLeuGlnThrGlySerGluGluLeuArgSer 
963 AGACAAATATTGGGACAGCTACAGCCATCCCTTCAGACAGGATCAGAAGAACTTAGATCA 
TCTGTTTATAACCCTGTCGATGTCGGTAGGGAAGTCTGTCCTAGTCTTCTTGAATCTAGT 

1002 binl, 1008 mboll, 

LeuTyrAsnThrValAlaThrLeuTyrCysValHisGlnArglleAspValLysAspThr 
1023 TTATATAATACAGTAGCAACCCTCTATTGTGTACATCAAAGGATAGATGTAAAAGACACC 
AATATATTATGTCATCGTTGGGAGATAACACATGTAGTTTCCTATCTACATTTTCTGTGG 

LysGluAlaLeuGluLysI leGluGluGluGlnAsnLysSerLysLysLysAlaGlnGln 
1083 AAGGAAGCTTTAGAGAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAGGCACAGCAA 
TTCCTTCGAAATCTCTTCTATCTCCTTCTCGTTTTGTTTTCATTCTTTTTCCGTGTCGTT 



1087 hindlll, 1097 mboll, 1107 mboll, 



p25 



AlaAlaAlaAlaAlaGlyThrGlyAsnSerSerGlnValSerGlnAsnTyrProIleVal 
1143 GCAGCAGCTGCAGCTGGCACAG6AAACAGCAGCCAG6TCAGCCAAAATTACCCTATAGTG 
CGTCGTCGACGTCGACCGTGTCCTTTGTCGTCGGTCCAGTCGGTTTTAATGGGATATCAC 

*\ #N ^ />* 

1147 pvull, 1150 pstl, 1153 pvull, 1156 tthllll, 

GlnAsnLeuGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrp 
1203 CAGAACCTACAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAAT6CATGG 
GTCTTGGATGTCCCCGTTTACCATGTAGTCCGGTATAGTGGATCTTGAAATTTACGTACC 

1250 ahalll, 1255 ava3, 

ValLysValValGluGluLysAlaPheSerProGluVal I leProMetPheSerAlaLeu 
1263 GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 
CATTTTCATCATCTTCTTTTCCGAAAGTCGGGTCTTCATTATGGGTACAAAAGTCGTAAT 

1275 mboll, 

SerGluGlyAl aThrProG 1 nAspLeuAsnThrMetLeuAsnThrVa IGlyGlyHisGln 
1323 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 
AGTCTTCCTCGGTGGGGTGTTCTAAATTTGTGGTACGATTTGTGTCACCCCCCTGTAGTT 

1346 ahalll, 

AlaAlaMetGlnMetLeuLysGIuThrlleAsnGluGluAlaAlaGluTrpAspArgVal 
1383 GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 
CGTCGGTACGTTTACAATTTTCTCTGATAGTTACTCCTTCGACGTCTTACCCTATCTCAC 

1423 pstl, 

HisProValHisAlaGlyProIleAlaProGlyGlnMetArgGluProArgGlySerAsp 
1443 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 
GTAGGTGACGTACGTCCCGGATAACGTGGTCCGGTTTACTCTCTTGGTTCCCCTTCACTG 

1451 sphl, 
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IleAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProPro 
1503 ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 
TATCGTCCTTGATGATCATGGGAAGTCCTTGTTTATCCTACCTACTGTTTATTAGGTGGA 

I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leValArg 
1563 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 
TAGGGTCATCCTCTTTAGATATTTTCTACCTATTAGGACCCTAATTTATTTTATCATTCT 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGluProPheArgAsp 
1623 ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 
TACATATCGGGATG6TCGTAAGACCTGTATTCTGTTCCTGGTTTCCTTGGGAAATCTCTA 

1636 bstXI, 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1683 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 
ATACATCTGGCCAAGATATTTTGAGATTCTCGGCTTGTTCGAAGTGTCCTACATTTTTTA 

1720 hindlll, 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLys 
1743 TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 
ACCTACTGTCTTTGGAACAACCAGGTTTTACGTTTGGGTCTAACATTCTGATAAAATTTT 

1796 ahalll, 

AlaLeuG 1 yProA 1 aA 1 aThrLeuG 1 uG 1 uMetMetThrA 1 aCysG InG lyValGlyGly 
1803 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 
CGTAACCCTGGTCGTCGATGTGATCTTCTTTACTACTGTCGTACAGTCCCTCACCCCCCT 

1827 mboll, 

ProGiyHisLysAlaArgVAlLeuAlaGluAlaMetSerGlnVAlThrAsnProAlaAsn 
1863 CCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCCATGAGCCAAGTAACAAATCCAGCTAAC 
GGGCCGGTATTTCGTTCTCAAAACCGACTTCGGTACTCGGTTCATTGTTTAGGTCGATT6 

£18 

I leMetMe tG 1 nA rgG 1 yAsnPheA rgAsnG 1 nArgLysThrVa 1 LysCysPheAsnCys 
1923 ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGAAAGACTGTTAAGTGTTTCAATTGT 
TATTACTACGTCTCTCCGTTAAAATCCTTGGTTTCTTTCTGACAATTCACAAAGTTAACA 



GlyLysGluGlyHisIleAlaLysAsnCysArgAlaProArgLysLysGlyCysTrpArg 
1983 GGCAAAGAAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAGA 
CCGTTTCTTCCCGTGTATCGGTTTTTAACGTCCCGGGGATCCTTTTTCCCGACAACCTCT 

2014 apal, 2019 avr2, 

CysGlyArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGly 
2043 TGTGGAA6GGAAGGACACCAAATGAAAGATTGCACTGAGAGACAGGCTAATTTTTTAGGG 
ACACCTTCCCTTCCTGTGGTTTACTTTCTAACGTGACTCTCTGTCCGATTAAAAAATCCC 

2102 mboll, 

n ^ Lysl leTrpProSerTyrLysGlyArgProGlyAsnPheLeuGlnSerArgProGluPro 
2103 AAGATCTGGCCTTCCTACAAGGGAAGGCCAGG6AATTTTCTTCAGAGCAGACCAGAGCCA 
TTCTAGACCGGAAGGATGTTCCCTTCCGGTCCCTTAAAAGAAGTCTCGTCTGGTCTC6GT 

2104 bglll. 2141 mboll, 
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ThrAlaProProGluGluSerPheArgPheGlyGluGluLysThrThrProSerGlnLys 
2163 ACAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAGGAGAAAACAACTCCCTCTCAGAAG 
TGTCGGGGTGGTCTTCTCTCGAAGTCCAAACCCCTCCTCTTTTGTTGAGGGAGAGTCTTC 

2175 mboll, 

GlnGluProIleAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlyAsn 
2223 CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAAC 
GTCCTCGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTG 

AspProSerSerGlnOC 
2283 GACCCCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGA 
CTGGGGAGCAGTGTTATTCCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCT 



„ _ „. MetAsnLeuProGlyLysTrpLysProLysMetlle 
2342 GCAGATGATACAGTATTAGAAGAAATGAATTTGCCAGGAAAATGGAAACCAAAAATGATA 
CGTCTACTATGTCATAATCTTCTTTACTTAAACGGTCCTTTTACCTTTGGTTTTTACTAT 

2360 mboll, 2375 bstXI, 

GlyGlylleGlyGlyPhelleLysValAraGlnTyrAspGlnlleProValGluIleCys 
2402 GGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTACGATCAGATACCTGTAGAAATCTGT 
CCCCCTTAACCTCCAAAATAGTTTCATTCTGTCATGCTAGTCTATGGACATCTTTAGACA 

„ GlyHisLysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArg 
2462 GGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGA 
CCTGTATTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCT 

2517 mboll, 

^ AsnLeuLeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrVal 
2522 AATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTA 
TTAGACAACTGAGTCTAACCAACATGAAATTTAAAGGGGTAATCAGGATAACTTTGACAT 

2548 ahalll, 2577 tthllll, 

ProValLysLeuLysProGlyMetAspGlyProLysValLysGlnTrpProLeuThrGlu 
2582 CCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATGGCCATTGACAGAA 
GGTCATTTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTCGTTACCGGTAACTGTCTT 

2627 ball, 2639 mboll, 

GluLysIleLysAlaLeuValGluIleCysThrGluMetGluLysGluGlyLysIleSer 
2642 GAAAAAATAAAAGCATTAGTAGAGATATGTACAGAAATGGAAAAGGAAGGGAAAATTTCA 
CTTTTTTATTTTCGTAATCATCTCTATACATGTCTTTACCTTTTCCTTCCCTTTTAAAGT 

UsIleGlyProGluAsnProTyrAsnThrProValPheAlalleLysLysLysAspSer 
2702 AAAATTGGGCCTGAAAATCCATACAATACTCCAGTATTTGCTATAAAGAAAAAAG 

TTTTAACCCGGACTTTTAGGTATGTTATGAGGTCATAAACGATATTTCTTTTTTCTGTCA 

2759 seal, 

TDCWf Tr DA rsLy sLeuVa 1 AspPheA rgG 1 uLeuAsnLysA rgThrG 1 nAspPheTrp 
2762 ACTAAATGGAGAAAACTAGTA6ATTTCAGA6AACTTAATAAAAGAACTCAAGACTTCTGG 
TGATTTACCTCTTTTGATCATCTAAAGTCTCTTGAATTATTTTCTTGAGTTCTGAAGACC 

nnnn GluyalGlnLeuGlyl leProHisProGlnGlyOC 

2822 GAAGTTCAGTTAGGAATACCACACCCGCAGGGTTAAAAAAAG 

CTTCAAGTCAATCCTTATGGTGTGGGCGTCCCAATTTTTTTCTTTTTTAGTCATTGTCAT 
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2882 TTG6AT6TGGGT6ATGCATACTTTTCAGTTCCCTTA6ATAAAGACTTTA6AAAGTATACTG 
AACCTACACCCACTACGTATGAAAAGTCAAGGGAATCTATTTCTGAAATCTTTCATATGAC 

2895 ava3, 



MetArgHisGlnGlyLeuAsDlleSerThrMetTrp 
2913 CATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGG 
GTAAATGGTATGGATCATATTTGTTACTCTGTGGTCCCTAATCTATAGTCATGTTACACC 

2985 ecor5, 

LeuProGlnGlyTrpLysGlySerProAlallePheGlnSerSerMetThrLysIleLeu 
3003 CTGCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATCTTA 
GACGGTGTCCCTACCTTTCCTAGTGGTCGTTATAAGGTTTCATCGTACTGTTTTTAGAAT 

3003 tthllll. 3006 bstXI, 3021 binl, 

GluProPheArgLysGlnAsnProAspI leVal I leTyrGlnTyrMetAspAspLeuTyr 
3063 GAGCCTTTTAGAAAACAGAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTAT 
CTCGGAAAATCTTTTGTCTTAGGTCTGTATCAATAGATAGTTATGTACCTACTAAACATA 

Va 1 G 1 y Se r AspLeuG 1 u 1 1 eG 1 yG 1 nH i s A rgTh rLysI leGluGluLeuArgGlnHis 
3123 GTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAACTGAGACAGCAT 
CATCCTAGACTGAATCTTTATCCCGTCGTATCTTGTTTTTATCTCCTTGACTCTGTCGTA 

3126 binl, 3171 tthllll, 

LeuLeuArgTrpGlyPheThrThrProAspLysLysHisGlnLysGluProProPheLeu 
3183 CTGTTGAGGTGGGGATTTACCACACCAGACAAAAAACATCAGAAAGAACCTCCATTCCTT 
6ACAACTCCACCCCTAAATGGTGTGGTCTGTTTTTTGTAGTCTTTCTTGGAGGTAAGGAA 

323 A bstXI, 

TrpMetGlyTyrGluLeuHisProAspLysTrpThrValGlnProIleMetLeuProGlu 
3213 TGGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTATAATGCTGCCAGAA 
ACCTACCCAATACTTGAGGTAGGACTATTTACCTGTCATGTCGGATATTACGACGGTCTT 

LysAspSerTrpThrValAsnAspIleGlnLysLeuValGlyLysLeuAsnTrpAlaSer 
3303 AAAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGAAAATTGAATTGGGCAAGT 
TTTCTGTCGACCTGACAGTTACTGTATGTCTTCAATCACCCTTTTAACTTAACCCGTTCA 

3308 pvull, 

GlnlleTyrAlaGIylleLysValLysGlnLeuCysLysLeuLeuArgGlyThrLysAla 
3363 CAGATTTATGCAGG6ATTAAAGTAAAGCAGTTATGTAAACTCCTTAGAGGAACCAAAGCA 
GTCTAAATACGTCCCTAATTTCATTTCGTCAATACATTTGAGGAATCTCCTTGGTTTCGT 

LeuTh rG 1 uVa 1 1 1 eProLeuTh rGluGluAlaGl uLeuG 1 uLeuA 1 aG 1 uAsnA rgG 1 u 
3123 CTAACAGAAGTAATACCACTAACAGAAGAAGCAGAGCTAGAACTGGCAGAAAACAGGGAG 
GATTGTCTTCATTATGGTGATTGTCTTCTTCGTCTCGATCTTGACCGTCTTTTGTCCCTC 

3W mboll, 

I leLeuLysG 1 uProVa 1 H i sG luVa 1 Ty rTyrAspProSer LysAspLeuVa lAlaGlu 
3183 ATTCTAAAAGAACCAGTACATGAAGTATATTATGACCCATCAAAAGACTTAGTAGCAGAA 
TAAGATTTTCTTGGTCATGTACTTCATATAATACTGGGTAGTTTTCTGAATCATCGTCTT 

_ _ I leGlnLysGlnGlyGlnGlyGlnTrpThrTyrGlnl leTyrGlnGluProPheLysAsn 
3513 ATACAGAAGCAGGGGCAAGGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAAT 
TATGTCTTCGTCCCCGTTCCGGTTACCTGTATAGTTTAAATAGTTCTCGGTAAATTTTTA 

3591 ahalll, 



POL 
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LeuLysThrGlyLysTyrAlaArgMetArgGlyAlaHisThrAsnAspValLysGlnLeu 
3603 CTGAAAACAGGAAAGTATGCAAGGATGAGGGGTGCCCACACTAATGATGTAAAACAGTTA 
GACTTTTGTCCTTTCATACGTTCCTACTCCCCACGGGTGTGATTACTACATTTTGTCAAT 

3659 hpal, 

ThrGluAlaValGlnLysValSerThrGluSerl leVal I leTrpGlyLysI leProLys 
3663 ACAGA6GCAGTGCAAAAAGTATCCACAGAAAGCATAGTAATATGGGGAAAGATTCCTAAA 
TGTCTCCGTCACGTTTTTCATAGGTGTCTTTCGTATCATTATACCCCTTTCTAAGGATTT 

PheLysLeuPro 1 1 eG 1 nLy sG 1 uTh rTrpGluAI aTrpTrpMetG 1 uTy rTrpG InAla 
3723 TTTAAACTACCCATACAAAAGGAAACATGGGAAGCATGGTGGATGGAGTATTGGCAAGCT 
AAATTTGATGGGTATGTTTTCCTTTGTACCCTTCGTACCACCTACCTCATAACCGTTCGA 

3723 ahalll, 

ThrTrpIleProGluTrpGluPheValAsnThrProProLeuValLysLeuTrpTyrGIn 
3783 ACCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCCCTTAGTGAAATTATGGTACCAG 
TGGACCTAAGGACTCACCCTCAAACAGTTATGGGGAGGGAATCACTTTAATACCATGGTC 

3835 kpnl, 

LeuGluLysGluProIleValGlyAlaGluThrPheTyrValAspGlyAlaAlaAsnArg 
3843 TTAGAGAAAGAACCCATAGTAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCTAATAGG 
AATCTCTTTCTTGGGTATCATCCTCGTCTTTGAAAGATACATCTACCCCGTCGATTATCC 

G 1 uTh rLysLeuG l y Lys A l aG 1 yTy rVa 1 Th r AspA rgG 1 yA rgG 1 nLy sVa 1 Va ISer 
3903 GAGACTAAATTAGGAAAAGCAGGATATGTTACTGACAGAGGAAGACAAAAAGTTGTCTCC 
CTCTGATTTAATCCTTTTCGTCCTATACAATGACTGTCTCCTTCTGTTTTTCAACAGAGG 

3943 mboll, 

IleAlaAspThrThrAsnGlnLysThrGluLeuGlnAlalleHisLeuAlaLeuGlnAsp 
3963 ATAGCTGACACAACAAATCAGAAGACTGAATTAGAAGCAATTCATCTAGCTTTGCAGGAT 
TATCGACTGTGTTGTTTAGTCTTCTGACTTAATGTTCGTTAAGTAGATCGAAACGTCCTA 

3983 mboll, 

SerGlyLeuGluValAsnl leValThrAspSerGlnTyrAlaLeuGlyl lei leGlnAla 
4023 TCGGGATTAGAAGTAAACATAGTAACAGACTCACAATATGCATTAGGAATCATTCAAGCA 
AGCCCTAATCTTCATTTGTATCATTGTCTGAGTGTTATAC6TAATCCTTAGTAAGTTCGT 

4060 ava3, 

GlnProAspLysSerGluSerGluLeuValSerGlnl lei leGluGlnLeuI leLysLys 
4083 CAACCAGATAAGAGTGAATCAGAGTTAGTCAGTCAAATAATAGAGCAGTTAATAAAAAA6 
GTTGGTCTATTCTCACTTAGTCTCAATCAGTCAGTTTATTATCTCGTCAATTATTTTTTC 

G 1 uLys Va ITy rLeuA 1 aTrpVa 1 ProA 1 aH i sLy sG 1 y I leG 1 yG 1 yAsnG 1 uG 1 nVa 1 
4143 GAAAAGGTCTACCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAAGTA 
CTTTTCCAGATGGACCGTACCCATGGTCGTGTGTTTCCTTAACCTCCTTTACTTGTTCAT 

4163 kpnl , 

AspLysLeuValSerAlaGlyl leArgLysValLeuPheLeuAsnGlyl leAspLysAla 
4203 GATAAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTGAATGGAATAGATAAGGCC 
CTATTTAATCAGTCACGACCTTAGTCCTTTCATGATAAAAACTTACCTTATCTATTCCGG 

4232 seal, 
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6 InG luG luH i sG 1 uLysTy rH i sSerAsnTrpArgA laMetA laSerAspPheAsnLeu 
4263 CAAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTG 
GTTCTTCTT6TACTCTTTATAGTGTCATTAACCTCTCGTTACCGATCACTAAAATTGGAC 

4266 mboll, 

ProProValValAlaLysGluI leValAlaSerCysAspLysCysGlnLeuLysGlyGlu 

4323 CCACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTCAGCTAAAAGGAGAA 
GGTGGACATCATCGTTTTCTTTATCATCGGTCGACACTATTTACAGTCGATTTTCCTCTT 

4352 pvull, 

AlaMetHisGIyGlnValAspCysSerProGlylleTrpGlnLeuAspCysThrHisLeu 
4383 GCCATGCATGGACAAGTA6ACTGTAGTCCAGGAATATGGCAACTAGATTGTACACATCTA 
CGGTACGTACCTGTTCATCTGACATCAGGTCCTTATACCGTTGATCTAACATGTGTAGAT 

4386 ava3, 4410 DStXI, 4439 xbal, 

GluGlyLysllelleLeuValAlaValHisValAlaSerGlyTyrlleGluAlaGluVal 
4443 GAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATAGAAGCAGAAGTT 
CTTCCTTTTTAATAGGACCATCGTCAAGTACATCGGTCACCTATATATCTTCGTCTTCAA 

4497 xmnl, 

IleProAlaGluThrGlyGlnGluThrAlaTyrPheLeuLeuLysLeuAlaGlyArgTrp 
4503 ATTCCAGCAGAGACAGGGCAGGAAACAGCATATTTTCTCTTAAAATTAGCAGGAAGATGG 
TAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATAAAAGAGAATTTTAATCGTCCTTCTACC 

4555 mboll, 4560 ball, 

ProVa 1 LysThr 1 leH i sThrAspAsnG lySerAsnPheThrSerThrThrVa RysAla 
4563 CCAGTAAAAACAATACATACAGACAATGGCAGCAATTTCACCAGTACTACGGTTAAGGCC 
GGTCATTTTTGTTATGTATGTCTGTTACCGTCGTTAAAGTGGTCATGATGCCAATTCCGG 

4605 seal, 

nn AlaCysTrpTrpAlaGlyl leLysGlnGluPheGlyl leProTyrAsnProGlnSerGln 
4623 GCCTGTTGGTGGGCAGGGATCAAGCAGGAATTTGGCATTCCCTACAATCCCCAAAGTCAA 
CGGACAACCACCCGTCCCTAGTTCGTCCTTAAACCGTAAGGGATGTTAGGGGTTTCAGTT 

4639 bin I, 

GlyValValGluSerMetAsnAsnGluLeuLysLysI lelleGlyGlnValArgAspGln 
4683 GGAGTAGTAGAATCTATGAATAATGAATTAAAGAAAATTATAGGACAGGTAAGAGATCAG 
CCTCATCATCTTAGATACTTATTACTTAATTTCTTTTAATATCCTGTCCATTCTCTAGTC 

, AlaGluHisLeuLysThrAlaValGlnMetAlaValPhelleHisAsnPheLysArgLys 
4743 GCTGAACACCTTAAGACAGCAGTACAAATGGCAGTATTCATCCACAATTTTAAAAGAAAA 
CGACTTGTGGAATTCTGTCGTCATGTTTACCGTCATAAGTAGGTGTTAAAATTTTCTTTT 

4752 alf 11, 4791 ahalll, 

, _ Gl^Glyl leGlyGlyTyrSerAlaGlyGluArgl leValAspI lei leAlaThrAspI le 
4803 GGGGGGATTGGGGGATACAGTGCAGGGGAAAGAATAGTA6ACATAATAGCAACAGACATA 
CCCCCCTAACCCCCTATGTCACGTCCCCTTTCTTATCATCTGTATTATCGTTGTCTGTAT 

GlnThrLysGluLeuGlnLysGlnl leThrLysI leGlnAsnPheArgValTyrTyrArg 
4863 CAAACTAAAGAACTACAAAAGCAAATTACAAAAATTCAAAATTTTCGGGTTTATTACAGG 
GTTTGATTTCTTGATGTTTTCGTTTAATGTTTTTAAGTTTTAAAAGCCCAAATAATGTCC 



FIG. 41 
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AspAsnLysAspProLeuTrpLysGlyProAlaLysLeuLeuTrpLysGlyGluGlyAla 
4923 GACAACAAA6ATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAAGGTGAAGGGGCA 
CTGTTGTTTCTAGGGGAAACCTTTCCT6GTCGTTTCGAAGAGACCTTTCCACTTCCCCGT 

4956 hindlll. 

Valval I leGInAspAsnSerAsp! leLysValValProArgArgLysAlaLysI lei le 
4983 GTAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAGAAAAGCAAAAATCATT 
CATCATTATGTTCTATTATCACTGTATTTTCATCACGGTTCTTCTTTTCGTTTTTAGTAA 

5023 mboll, 

MetGluAsnArgTrpGlnValMetl leValTrpGlnValAspArgMetArgl le 
ArgAspTyrGlyLysGlnMetAlaGlyAspAspCysValAlaSerArgGlnAspGluAsp 
5043 AGGGATTATGGAAAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGAT 
TCCCTAATACCTTTTGTCTACCGTCCACTACTAACACACCGTTCATCTGTCCTACTCCTA 

ArgTreTrpLysSerLeuVa 1 LysH i sH i sMetTyrl leSerLysLysA 1 aLysG 1 yTrp 
AM 

5103 TAGAACATGGAAAAGTTTAGTAAAACACCATATGTATATTTCAAAGAAAGCTAAAGGATGG 
ATCTTGTACCTTTTCAAATCATTTTGTGGTATACATATAAAGTTTCTTTCGATTTCCTACC 



5131 ndel, 

PheTyrArgHisHisTyrGluSerThrHisProArgValSerSerGluValHisIle 
5163 TTTTATAGACATCACTATGAAAGTACTCATCCAAGAGTAAGTTCAGAAGTACACATC 
AAAATATCTGTAGTGATACTTTCATGAGTAGGTTCTCATTCAAGTCTTCATGTGTAG 

5185 seal, 

ProLeuGlyAspAlaLysLeuVal I leThrThrTyrTrpGlyLeuHisThrGlyGluArg 
5221 CCCCTAGGGGATGCTAAATTGGTAATAACAACATATTGGGGTCTGCATACAGGAGAAAGA 
GG§GATCCCCTACGATTTAACCATTATTGTTGTATAACCCCAGACGTATGTCCTCTTTCT 

5223 avr2. 

_ G 1 uTrpH i sLeuG 1 yG 1 nG 1 yVa 1 A 1 a 1 1 eG 1 uTrpA rgLysLysLysTy rSe rThrG 1 n 
5281 GAATGGCATTTGGGCCAGGGAGTCGCCATAGAATGGAGGAAAAAGAAATATAGCACACAA 
CTTACCGTAAACCCGGTCCCTCAGCGGTATCTTACCTCCTTTTTCTTTATATCGTGTGTT 

ni , ValAspProGlyLeuAlaAspGlnLeuI leHisLeuHisTyrPheAspCysPheSerGlu 
5341 GTAGACCCTGGCCTAGCAGACCAACTAATTCATCTGCATTATTTTGATTGTTTTTCAGAA 
CATCTGGGACCGGATCGTCTGGTTGATTAAGTAGACGTAATAAAACTAACAAAAAGTCTT 

nn SerAlal leLysAsnAlal leLeuGlyTyrArgvalSerProArgCysGluTyrGlnAla 
5401 TCTGCTATAAAAAATGCCATATTAGGATATAGAGTTAGTCCTAGGTGTGAATATCAAGCA 
AGACGATATTTTTTACGGTATAATCCTATATCTCAATCAGGATCCACACTTATAGTTCGT 

5440 avr2, 

„, GlyHisAsnLysValGlySerLeuGlnTyrLeuAlaLeuAlaAlaLeul leThrProLys 
5461 GGACATAACAAGGTAGGATCTCTACAATACTTGGCACTAGCAGCATTAATAACACCAAAA 
CCTGTATTGTTCCATCCTAGAGATGTTATGAACCGTGATCGTCGTAATTATTGTGGTTTT 

5476 binl, 

,™ Ly?Tt3!;U^C9 p r2 LeuProServall -y sl -ysLeuThrGluAspArgTrpAsnLysPro 
5521 AAGACAAAGCCACCTTTGCCTAGTGTTAAGAAACTGACAGAGGATAGATGGAACAAGCCC 
TTCTGTTTCGGTGGAAACGGATCACAATTCTTTGACTGTCTCCTATCTACCTTGTTCGGG 



FIG. 4J 
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GlnLysThrLysGlyHisArgGlySerHisThrMetAsnGlyHisAM 
5581 CAGAAGACCAAGGGCCACAGAGGGAGCCATACAATGAATGGACACTAGAGCTTTTAGAGG 
GTCTTCTGGTTCCCGGTGTCTCCCTCGGTATGTTACTTACCTGTGATCTCGAAAATCTCC_ 

5583 mboll, 

5611 AGCTTAAGAGAGAAGCTGTTAGACATTTTCCTAGGCCATGGCTCCATAGCTTAGGACAAT 
TCGAATTCTCTCTTCGACAATCTGTAAAAGGATCCGGTACCGAGGTATCGAATCCTGTTA 

5643 aflll, 5670 avr2, 5676 ncol. 

5701 ATATCTATGAAACTTATGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTCTGC 
TATAGATACTTTGAATACCCCTATGAACCCGTCCTCACCTTCGGTATTATTCTTAAGACG 

5752 ecorl , 

5761 AACAACTGCTGTTTATTCATTTCAGAATTGGGTGTCAACATA6CAGAATAGGCATTATTC 
TTGTTGACGACAAATAAGTAAAGTCTTAACCCACAGTT6TATCGTCTTATCCGTAATAAG 

5821 AACAGAGGAGAGCAAGAAGAAATGGAGCCAGTAGATCCTAATCTAGAGCCCTGGAAGCAT 
TTGTCTCCTCTCGTTCTTCTTTACCTCGGTCATCTAGGATTAGATCTCGGGACCTTCGTA 

5836 mboll, 5862 xbal, 

5881 CCAGGAAGTCAGCCTAGGACTGCTTGTAACAATTGCTATTGTAAAAAGTGTTGCTTTCAT 
GGTCCTTCAGTCGGATCCTGACGAACATTGTTAACGATAACATTTTTCACAACGAAAGTA 

5893 avr2, 

5941 TGCTACGCGTGTTTCACAAGAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGA 
ACGATGCGCACAAAGTGTTCTTTTCCGAATCCGTAGAGGATACCGTCCTTCTTCGCCTCT 

5945 mlul, 5988 mboll, 

6001 CAGCGACGAAGAGCTCCTCAGGACAGTCA6ACTCATCAAGCTTCTCTATCAAAGCAGTAA 
GTCGCTGCTTCTCGAGGAGTCCTGTCAGTCTGAGTAGTTCGAAGAGATAGTTTCGTCATT 

6008 mboll, 6011 sacl, 6016 mstll, 6038 hindlll, 

6061 GTAGTAAATGTAATGCAATCTTTACAAATATTAGCAATAGTATCATTAGTAGTAGTAGCA 
CATCATTTACATTACGTTAGAAAT6TTTATAATCGTTATCATAGTAATCATCATCATCGT 

6121 ATAATAGCAATAGTTGTGTGGACCATAGTACTCATAGAATATAGGAAAATATTAAGACAA 
TATTATCGTTATCAACACACCTGGTATCATGAGTATCTTATATCCTTTTATAATTCTGTT 

6147 seal, 



MetLys 

6181 AGAAAATAGACAGATTAATTGATAGAATAAGAGAAAAAGCAGAAGACAGTGGCAATGAAA 
TCTTTTATCTGTCTAATTAACTATCTTATTCTCTTTTTCGTCTTCTGTCACCGTTACTTT 

6222 mboll, 

Va 1 LysG 1 yThr ArgA rg AsnTy rG 1 nH i sLeuTrpA rgTrpG 1 yTh r LeuLeuLeuG 1 y 
6241 GTGAAGGGGACCAGGAGGAATTATCAGCACTTGTGGAGATGGGGCACCTTGCTCCTTGGG 
CACTTCCCCTGGTCCTCCTTAATAGTCGTGAACACCTCTACCCCGTGGAACGAGGAACCC 

MetLeuMetlleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValPro 
6301 ATGTTGATGATCTGTAGTGCTACAGAAAAATTGTGGGTCACAGTTTATTATGGAGTACCT 
TACAACTACTAGACATCACGATGTCTTTTTAACACCCAGTGTCAAATAATACCTCATGGA 



ENV 
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ValTrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAspThr 
6361 GTGTG6AAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATATGATACA 
CACACCTTTCTTCGTTGATGGTGAGATAAAACACGTAGTCTACGATCTCGTATACTATGT 

6410 ndel, 

GluValHisAsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGlu 
6421 GAGGTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAA 
CTCCATGTATTACAAACCCGGT6TGTACGGACACATGGGTGTCTGGG6TTGGGTGTTCTT 

Va 1 Va 1 LeuG lyAsnVa 1 ThrG 1 uAsnPheAsnMetTrpLysAsnAsnMetVa 1G 1 uG In 
6481 GTAGTATTGGGAAAT6TGACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 
CATCATAACCCTTTACACTGTCTTTTAAAATTGTACACCTTTTTATTGTACCATCTTGTC 

MetGlnGluAspI lei leSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThr 
6541 ATGCAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACC 
TACGTCCTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACATTTTAATTGG 

6567 bini , 

ProLeuCysValThrLeuAsnCysThrAspLeuGlyLysAlaThrAsnThrAsnSerSer 
6601 CCACTCTGTGTTACTTTAAATTGCACTGATTTGGGGAAGGCTACTAATACCAATAGTAGT 
GGTGAGACACAATGAAATTTAACGTGACTAAACCCCTTCCGATGATTATGGTTATCATCA 

6615 ahalll, 

AsnTrpLysGluGluI leLysGlyGluI leLysAsnCy'sSerPheAsnl leThrThrSer 
6661 AATTGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGC 
TTAACCTTTCTTCTTTATTTTCCTCTTTATTTTTTGACGAGAAAGTTATAGTGGTGTTCG 

6670 mboll, 

I leArgAspLysl leGlnLysGluAsnAlaLeuPheArgAsnLeuAspValValProI le 
6721 ATAAGAGATAAGATTCAGAAAGAAAATGCACTTTTTCGTAACCTT6ATGTAGTACCAATA 
TATTCTCTATTCTAAGTCTTTCTTTTACGTGAAAAAGCATTGGAACTACATCATGGTTAT 

AspAsnAlaSerThrThrThrAsnTyrThrAsnTyrArgLeuIleHisCysAsnArgSer 
6781 GATAATGCTAGTACTACTACCAACTATACCAACTATAGGTTGATACATTGTAACAGATCA 
CTATTACGATCATGATGATGGTTGATATGGTTGATATCCAACTATGTAACATTGTCTAGT 

6790 seal, 

n Val I leThrGInAlaCysProLysValSerPheGluProI leProI leHisTyrCysThr 
6841 GTCATTACACAGGCCTGTCCAAAGGTATCATTTGAGCCAATTCCCATACATTATTGTACC 
CAGTAATGTGTCCGGACAGGTTTCCATA6TAAACTCGGTTAAGGGTATGTAATAACATGG 

6851 stul, 

_ ProAlaGIypheAlalleLeuLysCysAsnAsnLysThrPheAsnGlyLysGlyProCys 
6901 CC66CTGGTTTTGCGATTCTAAAGTGTAATAATAAAACGTTCAATGGAAAAGGACCATGT 
GGCCGACCAAAACGCTAAGATTTCACATTATTATTTTGCAAGTTACCTTTTCCTGGTACA 

^ , Th rAsnVa 1 Se rTh r Va 1 G 1 nCy sThrH 1 sG 1 y I le A rgP ro 1 1 eVa 1 Se rTh rG 1 nLeu 
6961 ACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAATAGTGTCAACTCAACTG 
TGTTTACAGTCGTGTCAT6TTACATGTGTACCTTAATCCGGTTATCACAGTTGAGTTGAC 

, „ LeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerAspAsnPheThrAsn 
7021 CTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGACAATTTCACGAAC 
GACAATTTACCGTCAGATCGTCTTCTTCTCCATCATTAATCTAGACTGTTAAAGTGCTTG 

7042 mboll, 7045 mboll, 7060 bglll. 

FIG. 4L 



REPLACEMENT SHEET 



AsnAlaLysThrl lei leValGlnLeuAsnGluSerValAlal leAsnCysThrArgPro 
7081 AAT6CTAAAACCATAATAGTACA6CT6AATGAATCTGTAGCAATTAACTGTACAAGACCC 
TTACGATTTTGGTATTATCAT6TCGACTTACTTAGACATCGTTAATTGACATGTTCTGGG 

7102 pvull, 

AsnAsnAsnThrArgLysSerl leTyrlleGlyProGlyArgAlaPheHisThrThrGly 
7141 AACAACAATACAAGAAAAAGTATCTATATAG6ACCAGGGAGAGCATTTCATACAACAGGA 
TTGTTGTTATGTTCTTTTTCATAGATATATCCTGGTCCCTCTCGTAAAGTATGTTGTCCT 

7199 mboll, 

ArgI lei leGlyAspI leArgLysAlaHisCysAsnl leSerArgAlaGlnTrpAsnAsn 
7201 AGAATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGAGCACAATGGAATAAC 
TCTTATTATCCTCTATATTCTTTTCGTGTAACATTGTAATCATCTCGTGTTACCTTATTG 

ThrLeuGIuGlnlleValLysLysLeuArgGluGlnPheGlyAsnAsnLysThrlleVal 
7261 ACTTTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGGAATAATAAAACAATAGTC 
TGAAATCTTGTCTATCAATTTTTTAATTCTCTTGTCAAACCCTTATTATTTTGTTATCAG 

PheAsnGlnSerSerGlyGlyAspProGluIleValMetHisSerPheAsnCysArgGly 
7321 TTTAATCAATCCTCAGGAGGGGACCCAGAAATTGTAATGCACAGTTTTAATTGTAGAGGG 
AAATTAGTTAGGAGTCCTCCCCTGGGTCTTTAACATTACGTGTCAAAATTAACATCTCCC 

7331 mstll, 

GluPhePheTyrCysAsnThrThrGlnLeuPheAsnAsnThrTrpArgLeuAsnHisThr 
7381 GAATTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGGAGGTTAAATCACACT 
CTTAAAAAGATGACATTATGTTGTGTTGACAAATTATTATGTACCTCCAATTTAGTGTGA 

GluGlyThrLysGlyAsnAspThrllelleLeuProCysArglleLysGlnllelleAsn 
GAAGGAACTAAAGGAAATGACACAATCATACTCCCATGTAGAATAAAACAAATTATAAAC 
CTTCCTTGATTTCCTTTACTGTGTTAGTATGAGGGTACATCTTATTTTGTTTAATATTTG 

„ M MetTrpGlnGluValGlyLysAlaMetTyrAlaProProIleGlyGlyGlnlleSerCys 
7501 ATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATTGGAGGACAAATTAGTTGT 
TACACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAACCTCCTGTTTAATCAACA 

SerSerAsnl leThrGlyLeuLeuLeuThrArgAspGlyGlyThrAsnValThrAsnAsp 
7561 TCATCAAATATTACAGGGGTGCTATTAACAAGAGATGGTGGTACAAATGTAACTAATGAC 
AGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATGTTTACATTGATTACTG 

JhrGluValPheArgProGlyGlyGlyAspMetArgAspAsnTrpArgSerGluLeuTyr 
7621 ACCGAGGTCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATAT 
TGGCTCCAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACCTCTTCACTTAATATA 

7628 mboll, 

_,r n , LysTyrLysVallleLysIleGluProLeuGlyileAlaProThrLysAlaLysArgArg 
7681 AAATATAAAGTAATAAAAATTGAACCATTAGGAATAGCACCCACCAAGGCAAAGAGAAGA 
TTTATATTTCATTATTTTTAACTTGGTAATCCTTATCGTGGGTGGTTCCGTTTCTCTTCT 

7736 mboll, 

„ ( , ValValGlnArgGluLysArgAlaValGlyl leValGlyAIaMetPheLeuGlyPheLeu 
7741 GTGGTGCAGAGAGAAAAAAGAGCAGTGGGAATAGTAGGAGCTATGTTCCTTGGGTTCTTG 
CACCACGTCTCTCTTTTTTCTCGTCACCCTTATCATCCTCGATACAAGGAACCCAAGAAC 

, oni GlYAlaAlaGlySerThrMetG 

7801 GGAGCAGCAGGAAGCACTATGGGCGCAGTGTCATTGACGCTGACGGTACAGGCCAGACAA 
CCTCGTCGTCCTTCGTGATACCCGCGTCACAGTAACTGCGACTGCCATGTCCGGTCTGTT 



FIG. 4M 
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LeuLeuSerGlyl leValGlnGlnGlnAsnAsnLeuLeuArgAlal leGluAlaGlnGln 
7861 TTATTGTCTGGTATAGTGCAACAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAA 
AATAACAGACCATATCACGTTGTCGTCTTGTTAAACGACTCCCGATAACTCCGCGTTGTT 

HisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGlnAlaArgValLeuAlaVal 
7921 CATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAGTCCTGGCTGTG 
GTAGACAACGTTGAGTGTCAGACCCCGTAGTTCGTCGAGGTCCGTTCTCAG6ACCGACAC 

GluArgTyrLeuArgAspGlnGlnLeuLeuGlyl leTrpGlyCysSerGlyLysLeuI Ie 
7981 GAAAGATACCTAAGGGATCAACAGCTCCTAGGGATTTGGGGTTGCTCTGGAAAACTCATT 
CTTTCTATGGATTCCCTAGTTGTCGAGGATCCCTAAACCCCAACGAGACCTTTTGAGTAA 

7989 mstll, 7995 binl, 8007 avr2, 

CysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluAspIleTrp 
8041 TGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAAGACATTTGG 
ACGTGGTGACGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTCTGTAAACC 

8089 mboll , 

AspAsnMetThrTrpMetGlnTrpGluArgGluI leAspAsnTyrThrAsnThrl leTyr 
8101 GATAACATGACCTGGATGCAGTGGGAAAGAGAAATTGACAATTACACAAACACAATATAC 
CTATTGTACTGGACCTACGTCACCCTTTCTCTTTAACTGTTAATGTGTTTGTGTTATATG 

ThrLeuLeuGluGluSerGInAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeu 
8161 ACCTTACTTGAAGAATCGCAGAACCAACAAGAAAAGAATGAACAAGAATTATTAGAATTG 
TGGAATGAACTTCTTAGCGTCTTGGTTGTTCTTTTCTTACTTGTTCTTAATAATCTTAAC 

8170 mboll, 

AspLysTrpAlaSerLeuTrpAsnTrpPheSerl leThrAsnTrpLeuTrpTyrl leLys 
8221 GATAAGTGGGCAAGTTTGTGGAATTGGTTTAGCATAACAAACTGGCTGTGGTATATAAAG 
CTATTCACCCGTTCAAACACCTTAACCAAATCGTATTGTTTGACCGACACCATATATTTC 

1 1 ePhe I leMetl leValGlyGl yLeuVa 1 G 1 yLeuA rg 1 1 eVa 1 PheA laVal LeuSer 
8281 ATATTCATAATGATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTGCTTTCT 
TATAAGTATTACTATCATCCTCCGAACCATCCAAATTCTTATCAAAAACGACACGAAAGA 

1 1 eVa 1 Asn Ar gVa 1 A rgG 1 nG 1 yTy rSe r P roLeuSe r PheG 1 nTh r A rgLeuProVa 1 
8341 ATAGTGAATAGAGTTAGGCAGGGATACTCACCATTGTCATTTCAGACCCGCCTCCCAGTC 
TATCACTTATCTCAATCCGTCCCTATGAGTGGTAACAGTAAAGTCTGGGCGGAGGGTCAG 

8400 aval, 

ProArgGlyProAspArgProAspGlyl 1 eG 1 uG 1 uG 1 uG 1 yG 1 yG 1 uArgAspA rgAsp 

8101 CCGAGGGGACCCGACAGGCCCGACGGAATCGAAGAAGAAGGTGGAGAGA6AGACAGAGAC 
GGCTCCCCTGGGCTGTCCGGGCTGCCTTAGCTTCTTCTTCCACCTCTCTCTCTGTCTCTG 

8431 mboll, 8434 mboll, 

n „ ArgSerValArgLeuValAspGlyPheLeuAlaLeul leTrpGluAspLeuArgSerLeu 
8461 AGATCCGTTCGATTAGTGGATGGATTCTTAGCACTTATCTGGGAAGATCTGCGGAGCCTG 
TCTAGGCAAGCTAATCACCTACCTAAGAATCGTGAATAGACCCTTCTAGACGCCTCGGAC 

8503 mboll, 8505 bglll, 

CysLeuPheSerTyrArgArgLeM^ 
8521 TGCCTCTTCAGCTACCGCCGCTTGAGAGACTTACTCTTGATTGCAGCGAGGACTGTGGAA 
ACGGAGAAGTCGATGGCGGCGAACTCTCTGAATGAGAACTAACGTCGCTCCTGACACCTT 

8525 mboll, 

FIG.4N 
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IleLeuGlyHisArgGlyTrpGluAlaLeuLysTyrTrpTrpSerLeuLeuGlnTyrTrp 
8581 ATTCTGGGGCACAGGG66TGG6AAGCCCTCAAATATTGGTGGAGTCTCCTGCAGTATTGG 
TAAGACCCCGTGTCCCCCACCCTTCGGGAGTTTATAACCACCTCAGAGGACGTCATAACC 

8629 pstl, 

IleGlnGluLeuLysAsnSerAlaValSerTrpLeuAsnAlaThrAlalleAlaValThr 
8641 ATTCAGGAACTAAAGAATAGT6CTGTTAGCTGGCTCAACGCCACAGCTATAGCAGTAACT 
TAAGTCCTTGATTTCTTATCACGACAATCGACCGAGTTGCGGTGTCGATATCGTCATTGA 

GluGIyThrAspArgVallleGluValAlaGlnArgAlaTyrArgAlalleLeuHisIle 
8701 GAGGGGACAGATAGGGTTATAGAAGTAGCACAAAGAGCTTATAGAGCTATTCTCCACATA 
CTCCCCTGTCTATCCCAATATCTTCATCGTGTTTCTCGAATATCTCGATAAGAGGTGTAT 



HisArgArgl leArgGlnGlyLeuGluArgLeuLeuLeuOC MetGlyGlyLysTrpSer 
8761 CATAGAAGAATTAGACAGGGCTTGGAAAGGCTTTTGCTATAAGATGGGTGGCAAGTGGTCA 
GTATCTTeTTAATCTGTCCCGAACCTTTCCGAAAACGATATTCTACCCACCGTTCACCAGT 



8765 mboll, 

LysArgSerMetGlyGlyTrpSerAlalleArgGluArgMetArgArgAlaGluProArg 
8822 AAACGTAGTATGGGTGGATGGTCTGCTATAAGGGAAA6AATGAGACGAGCTGAGCCACGA 
TTTGCATCATACCCACCTACCAGACGATATTCCCTTTCTTACTCTGCTCGACTCGGTGCT 

AlaGluProAlaAlaAspGlyValGlyAlaValSerArgAspLeuGluLysHlsGlyAla 

8882 GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGTATCTCGAGACCTGGAAAAACATGGAGCA 
CGACTCGGTCGTCGTCTACCCCACCCTCGTCATAGAGCTCTGGACCTTTTTGTACCTCGT 

8883 tthllll, 8916 aval xhol, 

IleThrSerSerAsnThrAlaAlaThrAsnAlaAspCysAlaTrpLeuGluAlaGlnGlu 
8942 ATCACAAGTAGCAATACAGCAGCTACTAATGCTGATTGTGCCTGGCTAGAAGCACAAGAG 
TAGTGTTCATCGTTATGTCGTCGATGATTACGACTAACACGGACCGATCTTCGTGTTCTC 

n GluGluGluValGl yPheProVa 1 A rgProG 1 nVa 1 ProLeuA rgProMe tTh rTy rLys 

9002 GAGGAAGAGGTGGGTTTTCCAGTCAGACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
CTCCTTCTCCACCCAAAAGGTCAGTCTGGAGTCCATGGAAATTCTGGTTACTGAATGTTC 

9005 mboll, 9029 mstll, 9034 kpnl , 

™™ Alf$i§i??y^§eUl s § rHisPheLeL| LysGluLysGlyG]yLeuGluGlyLeuneTrp 
9062 GCAGCTTTAGATATTAGCCACTTTTTAAAAGAAAA6GGGGGACTGGAAGGGCTAATTTGG 
CGTCGAAATCTATAATCGGTGAAAAATTTTCTTTTCCCCCCTGACCTTCCCGATTAAACC 

9085 ahalll, 

SerGlnArgArgGlnGluI leLeuAspLeuTrpI leTyrHisThrGlnGlyTyrPhePro 
9122 TCCCAAAGAAGACAAGAGATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCT 
AGGGTTTCTTCTGTTCTCTAGGAACTAGACACCTAGATGGTGTGTGTTCCGATGAAGGGA 

9129 mboll, 9153 bin I, 

MM AspTrpGlnAsnTyrThrProGlyProGlylleArgTyrProLeuThrPheGlyTrpCys 
9182 GATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCACTGACCnTGGATGGTGC 
CTAACCGTCTTAATGTGTGGTCCCGGTCCCTAGTCTATAGGTGACTGGAAACCTACCACG 

9210 bini, 9216 ecor5, 



FIG. 40 
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PheLysLeuValProVal61uProGluLysValGluGluAlaAsnGluGly61uAsnAsn 
9242 TTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAATGAAGGAGAGAACAAC 
AAGTTCGATCATGGTCAACTCGGTCTCTTCCATCTTCTCCGGTTACTTCCTCTCTTGTTG 

9275 mboll, 

SerLeuLeuHisProMetSerLeuHisGlyMetGluAspAla61uLysGluValLeuVal 
9302 AGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGAGAAAGAAGTGTTAGTG 
TCGAACAATGTGGGATACTCGGACGTACCCTACCTCCTGCGCCTCTTTCTTCACAATCAC 

TrpArgPheAspSe rLysLeuA laPheH i sH i sMetA 1 aA rgG 1 uleuH i sProG 1 uTy r 
9362 TGGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGAGCTGCATCCGGAGTAC 
ACCTCCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCTCGACGTAGGCCTCATG 

9399 aval, 9417 seal, 

TyrLysAspCysOP 

9422 TACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGCTGGGGACTTTCCAGGG 
ATGTTTCTGACGACTGTAGCTCGAAAGATGTTCCCTGAAAGGCGACCCCTGAAAGGTCCC 



9482 AGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATGCTGCATATAAGCAGCTG 
TCCGCACCGGACCCGCCCTGACCCCTCACCGCAGGGAGTCTACGACGTATATTCGTCGAC 

9536 pvull, 

9542 CTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGA6CTCTCTGGC 
GAAAAACGGACATGACCCAGAGAGACCAATCTGGTCTAGACTCGGACCCTCGAGAGACCG 

9576 bglll, 9590 sacl, 

9602 TAACTAGGGAACCCACT6CTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTG 
ATTGATCCCTTGGGTGACGAATTCGGAGTTATTTCGAACGGAACTCACGAAGTTCATCAC 

9620 af 111, 9634 hindlll, 

9662 TGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGTG 
ACACGGGCAGACAACACACTGAGACCATTGATCTCTAGGGAGTCTGGGAAAATCAGTCAC 

9722 TGGAAAAATCTCTAGCAG 
ACCTTTTTAGAGATCGTC 
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U3-^ 

-453 CTGGAAGG6CTAATTTGGTCCCAAA6AAGACAAGAGATCCTTGATCT6TGGATCTACCAC 
ACACAAGGCTACTTCCCTGATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCA 

-333 CTGACCTTTGGATGGTGCTTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCC 
AATGAAGGAGAGAACAACAGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCG 

-214 GAGAAAGAAGTGTTAGTGTGGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGA 

GAGCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCG T 

-93 CTGGGGACTTTCCAGGGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATG 

CTGCATilAAGCAGCTGCTTTTTGCCTGTACTG GGTCTCTCTGGTTAGACCAGATCTGAG R 
28 CCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 
R U5-*~ 

GAGTGCTTCA AGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCA 

-«-U5 

148 GACCCTTTTAGTCAGTGTGGAAAAATCTCTAGCAG TGGCGCCCGAACAGGGACGCGAAA 
GCGAAAGTAGAACCAGA6GAGCTCTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACAG 

268 CAAGAGGCGAGGGGCGGCGACTGGTGAGTACGCCAATTTTT6ACTAGCGGAGGCTAGAAG 

MetGlyAlaArgAlaSerValLeuSerGlyGlyGluLeuAspLysTrpGlu 17 
GAGAGAGAGATGGGTGCGAGAGCGTCGGTATTAAGCGGGGGAGAATTAGATAAATGGGAA 

Lysl leArgLeuArgProGlyGlyLysLysLysTyrLysLeuLysHisIleValTrpAla 
388 AAAATTC6GTTAAGGCCAGGGGGAAAGAAAAAATATAAGTTAAAACATATAGTATGGGCA 

SerArgGluLeuGluArgPheAlaValAsnProGlyLeuLeuGluThrSerGluGlyCys 57 
AGCAGGGAGCTAGAACGATTCGCAGTCAATCCTGGCCTGTTAGAAACATCAGAAGGCTGC 

ArgGlnlleLeuGlyGlnLeuGlnProSerLeuGlnThrGlySerGluGluLeuArgSer 
508 AGACAAATATTGGGACAGCTACAGCCATCCCTTCAGACAGGATCAGAAGAACTTAGATCA 

LeuTyrAsnThrValAlaThrLeuTyrCysValHisGlnArglleAspValLysAspThr 97 
TTATATAATACAGTAGCAACCCTCTATTGTGTACATCAAAGGATAGATGTAAAAGACACC 

LysGluAlaLeuGluLysI leGluGluGluGlnAsnLysSerLysLysLysAlaGlnGln 
628 AAGGAAGCTTTAGAGAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAGGCACAGCAA 

AlaAlaAlaAlaAlaGlyThrGlyAsnSerSerGlnValSerGlnAsnTyrProIleVal 137 
GCAGCAGCTGCAGCTGGCACAGGAAACAGCAGCCAGGTCAGCCAAAATTACCCTATAGTG 

GlnAsnLeuGlnGlyGlnMetValHisGInAlalleSerProArgThrLeuAsnAlaTrp 
748 CAGAACCTACAGGGGCAAATGGTACATCAGGCCATATCACCTA6AACTTTAAATGCATGG 

ValLysValValGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeu 177 
GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 

SerGluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 
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AlaAlaMet61nMetLeuLysGiuThrIleAsn61uGluAlaAlaGluTrpAspArgVal 217 n 
GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 

HisProValHisAlaGlyProI leAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC A 

IleAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProPro 257 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leValArg ^ 
1108 ATCCCAGTAGGAGAAATCTATAAAA6ATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerlleLeuAspIleArgGlnGlyProLysGluProPheArgAsp 297 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLys 337 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 

AlaLeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGly 
1348 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAG6GAGTGGGGGGA 

ProG 1 y H i sLys A 1 aA rgVa HeuAlaGluAi aMetSe rG 1 nVa 1 ThrAsnProA 1 aAsn 377 
CCCGGCCATAAAGCAAGAGTTTTGGCT6AAGCCATGAGCCAAGTAACAAATCCAGCTAAC 

1 leMetMetGlnArgGlyAsnPheArgAsnGlnArgLysThrValLysCysPheAsnCys 
1^68 ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGAAAGACTGTTAAGTGTTTCAATTGT 

GlyLysGluGlyHisIleAlalysAsnCysArgAlaProArgLysLysGlyCysTrpArg 417 
GGCAAAGAAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAGA 

CysGlyArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGly 

PhePheArgG 

1588 TGTGGAA6GGAAGGACACCAAATGAAAGATT6CACTGAGAGACAGGCTAATTTTTTAGGG 

LysIleTrpProSerTyrLysGlyArgProGlyAsnPheLeuGlnSerArgProGluPro 457 
1 uAspLeuA laPheLeuG 1 nG 1 yLysA 1 aA rgG 1 uPheSerSerG 1 uG 1 nfhr A rgA 1 a 23 
AAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGAGCCA 

ThrA 1 aProP roG 1 uG 1 uSe rPheA rgPheG lyGluGl uLy sThrThrProSe rG 1 nLys 
K AsnSerProThrArgAroGluLeuGlnValTrpGlyGlyGluAsnAsnSerLeuSerGluA D 
1708 ACAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAGGAGAAAACAACTCCCTCTCAGAAG p 

GInGluProIleAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlyAsn 497 
laGl yA laAspArgG 1 nG lyThrVa 1 SerPheAsnPheProG 1 n I leThrLeuTrpGl n 63 
CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAAC 0 

AspProSerSerGlnOC 
,„„ ArgProLeyValThrlleArglleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyA 
1828 GACCCCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAG |_ 

laAspAspThrVa 1 LeuGluG 1 uMetAsnLeuProG 1 yLy sTrpLysProLysMet He 103 
CAGATGATACAGTATTAGAAGAAATGAATTTGCCAGGAAAATGGAAACCAAAAATGATAG 

GlyGlyl leGlyGlyPhel leLysValArgGlnTyrAspGlnl leProValGluI leCysG 
1948 GGGGAATTGGAGGTTTTATCAAAGTAAGACAGTACGATCAGATACCTGTAGAAATCTGTG 
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lyHisLysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArg 143 
GACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAA 

AsnLeuLeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProI leGluThrValP 
2068 ATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTAC 

ToValLysLeuLysProGlyMetAspGlyProLysValLysGlnTrpProLeuThrGlu 183 
CAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATGGCCATTGACAGAAG 

GluLysIleLysAlaLeuValGluIleCysThrGluMetGluLysGluGiyLysIleSerL 
2188 AAAAAATAAAAGCATTAGTAGAGATATGTACAGAAATGGAAAAGGAAGGGAAAATTTCAA 

ysIleGlyProGluAsnProTyrAsnThrProValPheAlalleLysLysLysAspSer 223 
AAATTGGGCCTGAAAATCCATACAATACTCCAGTATTTGCTATAAAGAAAAAAGACAGTA 

ThrLysTrpArgLysteuValAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpG 
2308 CTAAATGGAGAAAACTAGTAGATTTCAGAGAACTTAATAAAAGAACTCAAGACTTCTGGG 

luValGlnLeuGlylleProHisProAlaGlyLeuLysLysLysLysSerValThrVal 263 
AAGTTCAGTTAGGAATACCACACCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTAT 

LeuAspValGlyAspAlaTyrPheSerValProLeuAspLysAspPheArgLysTyrThrA 
2428 TGGATGT6GGTGATGCATACTTTTCAGTTCCCTTAGATAAAGACTTTAGAAAGTATACTG 

laPheThrlleProSerlleAsnAsnGluThrProGlylleArgTyrGlnTyrAsnVal 303 
CATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGC 

LeuProGlnGlyTrpLysGlySerProAlal JePheGlnSerSerMetThrLysI leLeuG 
2548 TGCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATCTTAG 

luProPheArgLysGlnAsnProAspI leVal I leTyrGlnTyrMetAspAspLeuTyr 343 
AGCCTTTTAGAAAACAGAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTATG 

ValGlySerAspLeuGluI leGlyGlnHisArgThrLys! leGluGluLeuArgGlnHisL 
2668 TAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAG6AACTGAGACAGCATC 

euLeuArgTrpGlyPheThrThrProAspLysLysHisGlnLysGluProProPheLeu 383 
TGTTGAGGTGGGGAT.TTACCACACCAGACAAAAAACATCAGAAAGAACCTCCATTCCTTT 

TrpMetG 1 yTy rG 1 uLeuH i sProA spLysTrpThrVa lGlnProI leMetLeuProG iuL 
2788 GGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTATAATGCTGCCAGAAA 

ysAspSerTrpThrValAsnAspIleGlnLysLeuValGlyLysLeuAsnTrpAlaSer 423 
AAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGAAAATTGAATT6GGCAAGTC 

Glnl leTyrAlaGlyl leLysValLysGlnLeuCysLysLeuLeuArgGlyThrLysAlaL 
2908 AGATTTATGCAGGGATTAAAGTAAAGCAGTTATGTAAACTCCTTAGAGGAACCAAAGCAC 

euTh rG luVal 1 1 eProLeuTh rG 1 uG 1 uA 1 aG 1 uLeuG 1 uLeuA 1 aG 1 uAsnA r gG 1 u 463 
TAACAGAAGTAATACCACTAACAGAA6AAGCAGAGCTAGAACTGGCAGAAAACAGGGAGA 

IleLeuLysGluProValHisGluValTyrTyrAspProSerLysAspLeuValAlaGluI 
3028 TTCTAAAAGAACCAGTACATGAAGTATATTATGACCCATCAAAAGACTTAGTAGCAGAAA 

leG 1 nLysG 1 nG 1 yG 1 nG 1 yGl nTr pThrTy rG 1 n 1 1 eTyrG 1 nG 1 uProPheLysAsn 503 
TACAGAAGCAGGGGCAAGGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATC 

LeuLysTh rG lyLysTyrA 1 aArgMetArgG lyA 1 aH i sThrAsnAspVa 1 LysG 1 nLeuT 
3148 TGAAAACAGGAAAGTATGCAAGGATGAGGGGTGCCCACACTAATGATGTAAAACAGTTAA 

hrGluAlaValGlnLysvalSerThrGluSerl leVal I leTrpGlyLysI leProLys 543 
CAGAGGCAGTGCAAAAAGTATCCACAGAAAGCATAGTAATATGGGGAAAGATTCCTAAAT 
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PheLysLeuPro I leGl nLysG 1 uTh rTrpG 1 uA laTrpTrpMetG 1 uTy rTrpG 1 nA laT 
3268 TTAAACTACCCATACAAAAGGAAACATGGGAAGCATGGTGGATGGAGTATTGGCAA6CTA 

hrTrplleProGluTrpGluPheValAsnThrProProLeuValLysLeuTrpTyrGIn 583 
CCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCCCTTAGTGAAATTATGGTACCAGT 

LeuGluLysGluProIleValGlyAlaGluThrPheTyrValAspGlyAlaAlaAsnArgG 
3388 TAGAGAAAGAACCCATAGTAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCTAATAGGG 

IuThrLysLeu61yLysAlaGlyTyrValThrAspArgGlyArgGInLysValValSer 623 
AGACTAAATTAGGAAAAGCAGGATATGTTACTGACAGAGGAAGACAAAAAGTTGTCTCCA 

IleAlaAspThrThrAsnGlnLysThrGluLeuGlnAlalleHisLeuAlaLeuGlnAspS 
3508 TAGCTGACACAACAAATCAGAAGACTGAATTACAAGCAATTCATCTAGCTTTGCAGGATT 

erGlyLeuGluValAsnlleValThrAspSerGlnTyrAlaLeuGlyllelleGlnAla 663 
CGGGATTAGAAGTAAACATAGTAACAGACTCACAATATGCATTAGGAATCATTCAAGCAC 

GInProAspLysSerGluSerGluLeuValSerGlnllelleGluGlnLeuI leLysLysG 
3628 AACCAGATAAGAGTGAATCAGAGTTAGTCAGTCAAATAATAGAGCAGTTAATAAAAAAGG 

lulysValTyrLeuAlaTrpValProAlaHisLysGlyl leGlyGlyAsnGluGlnVal 703 
AAAAGGTCTACCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAAGTAG 

AspLysLeuValSerAlaGlylleArgLysValLeuPheLeuAsnGlylleAspLysAlaG 
3748 ATAAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTGAATGGAATAGATAAGGCCC 

InGluGluHisGluLysTyrHisSerAsnTrpArgAlafletAlaSerAspPheAsnLeu 743 
AAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTGC 

ProProValValAlaLysGluIleValAlaSerCysAspLysCysGlnLeuLysGlyGluA 
3868 CACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTCAGCTAAAAGGAGAAG 

laMetHisGlyGlnValAspCysSerProGlylleTrpGlnLeuAspCysThrHisLeu 783 
CCATGCATGGACAAGTAGACTGTAGTCCAGGAATATGGCAACTAGATTGTACACATCTAG 

, GluGlyLysIlelleLeuValAlaValHisValAlaSerGlyTyrlleGluAlaGluVall 

3988 AAGGAAAAATTATCCTGGTAGCAGTTCATGTA6CCAGTGGATATATAGAAGCAGAAGTTA 

1 ePro A 1 aG 1 uTh rG 1 yG 1 nG 1 uTh r A 1 aTy r PheleuLeuLysLeuA laGlyA rgTrp 823 
TTCCAGCAGAGACAGGGCAGGAAACAGCATATTTTCTCTTAAAATTAGCAGGAAGATGGC 

, , Am ProValLysThrIleHisThrAspAsnGlySerAsnPheThrSerThrThrValLysAlaA 
4108 CAGTAAAAACAATACATACAGACAAT6GCAGCAATTTCACCAGTACTACGGTTAAG6CCG 

laCysTrpTrpAlaGlyl leLysGlnGluPheGlyl leProTyrAsnProGlnSerGln 863 
CCTGTTGGTGGGCAGGGATCAAGCAGGAATTTGGCATTCCCTACAATCCCCAAAGTCAAG 

, „_ GlyValValGluSerMetAsnAsnGluLeuLysLysI lei leGlyGlnValArgAspGlnA 
4228 GAGTAGTAGAATCTAT6AATAATGAATTAAAGAAAATTATAGGACAGGTAAGAGATCAGG 

1 aG 1 uH i sLeuLysTh rA 1 aVa 1 G 1 nMetA 1 aVa 1 Phe I leH i sAsnPheLysA rglys 903 
CTGAACACCTTAAGACAGCAGTACAAATGGCAGTATTCATCCACAATTTTAAAAGAAAAG 

, „ rt GlyGlyI leGlyGlyTyrSerAlaGlyGluArgl leValAspI lei leAlaThrAsDl leG 
4348 GGGGGATTGGGGGATACAGTGCA6GGGAAAGAATAGTAGACATAATAGCAACAGACATAC 

1 nThrLysG 1 uLeuG 1 nLysG 1 n I leThrLys I leGl nAsnPheArgVa lTy rTy rArg 943 
AAACTAAAGAACTACAAAAGCAAATTACAAAAATTCAAAATTTTCGGGTTTATTACAGGG 
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AspAsnLysAspProLeuTrpLysGlyProAlaLysLeuLeuTrpLysGlyGluGlyAlaV 
4468 ACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAAGGTGAAGGGGCAG 

alVal I leGlnAspAsnSerAspI leLysValValProArgArgLysAlaLysI lei le 983 
TAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAGAAAAGCAAAAATCATTA 

ArgAspTyrGlyLysGlnMetAlaGlyAspAspCysValAlaSerArgGlnAspGluAspA 
4588 GGGATTATGGAAAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATT 

M 

AGAACATGGAAAAGTTTAGTAAAACACCATATGTATATTTCAAAGAAAGCTAAAGGATGG 
4708 TTTTATAGACATCACTATGAAAGTACTCATCCAAGAGTAAGTTCAGAAGTACACATCCCC 

CTAGGGGATGCTAAATTGGTAATAACAACATATTGGGGTCTGCATACAGGAGAAAGAGAA 
4828 TGGCATTTGGGCCAGGGAGTCGCCATAGAATGGAGGAAAAAGAAATATAGCACACAAGTA 

GACCCTGGCCTAGCAGACCAACTAATTCATCTGCATTATTTTGATTGTTTTTCAGAATCT 
4948 GCTATAAAAAATGCCATATTAGGATATAGAGTTAGTCCTAGGTGTGAATATCAAGCAGGA 

CATAACAAGGTAGGATCTCTACAATACTTGGCACTAGCAGCATTAATAACACCAAAAAAG 
5068 ACAAAGCCACCTTTGCCTAGTGTTAAGAAACTGACAGAGGATAGATGGAACAAGCCCCAG 

AAGACCAAGGGCCACAGAGGGAGCCATACAATGAATGGACACTAGAGCTTTTAGAGGAGC 
5188 TTAAGAGAGAAGCTGTTAGACATTTTCCTAGGCCATGGCTCCATAGCTTAGGACAATATA 

TCTATGAAACTTATGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTCTGCAAC 
5308 AACTGCTGTTTATTCATTTCAGAATTGGGTGTCAACATAGCAGAATAGGCATTATTCAAC 

AGAGGAGAGCAAGAAGAAATGGAGCCAGTAGATCCTAATCTAGAGCCCTGGAAGCATCCA 
5428 GGAAGTCAGCCTAGGACTGCTTGTAACAATTGCTATTGTAAAAAGTGTTGCTTTCATTGC 

TACGCGTGTTTCACAAGAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGCG6AGACAG 
5548 CGACGAAGAGCTCCTCAGGACAGTCAGACTCATCAAGCTTCTCTATCAAAGCAGTAAGTA 

GTAAATGTAATGCAATCTTTACAAATATTAGCAATAGTATCATTAGTAGTAGTAGCAATA 

5668 ATAGCAATAGTTGTGTGGACCATAGTACTCATAGAATATA6GAAAATATTAAGACAAAGA 

MetLysVal 

AAATAGACAGATTAATTGATAGAATAAGAGAAAAAGCAGAAGACAGTGGCAATGAAAGTG 



5788 



5908 




TrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAspThrGlu 
TGGAAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATATGATACAGAG 
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ValLeuGlyAsnValThrGluAsnPheAsnMetTrpLysAsnAsnMetValGluGlnMet 
6028 GTATTGGGAAATGTGACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAGATG 

GlnGluAspI lei leSerleuTrpAspGlnSerLeuLysProCysValLysLeuThrPro 123 
CAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCA 

LeuCysValThrLeuAsnCysThrAspLeuGlyLysAlaThrAsnThrAsnSerSerAsn 
6148 CTCTGTGTTACTTTAAATTGCACTGATTTGGGGAAGGCTACTAATACCAATAGTAGTAAT 

TrpLysGluGluIleLysGlyGlulleLysAsnCysSerPheAsnlleThrThrSerlle 163 
TGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGCATA 

ArgAspLysI leGlnLysGluAsnAlaLeuPheArgAsnLeuAspValValProl leAsp 
6268 AGSGATAAGATTCAGAAAGAAAATGCACTTTTTCGiAACCTTGATGTAGTACCAATAGAT 

AsnAlaSerThrThrThrAsnTyrThrAsnTyrArgLeuIleHisCysAsnArgSerVal 203 
AATGCTAGTACTACTACCAACTATACCAACTATAGGTTGATACATTGTAACAGATCAGTC 

I leThrGlnAlaCysProLysValSerPheGluProl leProI leHisTyrCysThrPro 
6388 ATTACACAGGCCTGTCCAAAGGTATCATTTGAGCCAATTCCCATACATTATTGTACCCCG 

AlaGlyPheAlal leLeuLysCysAsnAsnLysThrPheAsnGlyLysGlyProCysThr 243 E 
GCTGGTTTTGCGATTCTAAAGTGTAATAATAAAACGTTCAATGGAAAAGGACCATGTACA 

AsnValSerThrValGlnCysThrHisGlylleArgProIleValSerThrGlnLeuLeu 
6508 AATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAATAGTGTCAACTCAACTGCTG 

LeuAsnGlySerLeuAlaGluGluGluValVallleArgSerAspAsnPheThrAsnAsn 283 N 
TTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGACAATTTCACGAACAAT 

AlaLysThrl lei leValGlnLeuAsnGluSerValAlal leAsnCysThrArgProAsn 
6628 GCTAAAACCATAATAGTACAGCTGAATGAATCTGTAGCAATTAACTGTACAAGACCCAAC 

AsnAsnThrArgLysSerl IeTyrl leGlyProGlyArgAlaPheHisThrThrGlyArg 323 V 
AACAATACAAGAAAAAGTATCTATATAGGACCAGGGAGAGCATTTCATACAACAGGAAGA 

I lei leGlyAspI leArgLysAlaHisCysAsnl leSerArgAlaGlnTrpAsnAsnThr 
6748 ATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGAGCACAATGGAATAACACT 

LeuGluGlnl leValLysLysLeuArgGluGlnPheGlyAsnAsnLysThrlleValPhe 363 
TTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGGAATAATAAAACAATAGTCTTT 

AsnGlnSerSerGlyGlyAspProGluIlevalMetHisSerPheAsnCysArgGlyGlu 

6868 aatcaatcctcaggaggggacccagaaattgtaatgcacagttttaattgtagIggggaa 

PhePheTyrCysAsnThrThrGlnLeuPheAsnAsnThrTrpArgLeuAsnHisThrGlu 403 
TTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGGAGGTTAAATCACACTGAA 

GlyThrLysGlyAsnAspThrl lei leLeuProCysArgl leLysGlnl lei leAsnMet 
6988 GGAACTAAAGGAAATGACACAATCATACTCCCATGTAGAATAAAACAAATTATAAACATG 

TrpGlnGluValGlytysAlaMetTyrAlaProProI leGlyGlyGlnl leSerCysSer 443 
TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATTGGAGGACAAATTAGTTGTTCA 

SerAsn 1 1 eThrG 1 yLeuLeuLeuThrA rgAspG 1 yG 1 yThr AsnVa IThrAsnAspTh r 
7108 TCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTACAAATGTAACTAATGACACC 

GluValPheArgProGlyGlyGlyAspMetArgAspAsnTrpArgSerGluLeuTyrLys 483 
GAGGTCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
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TyrLysVallleLysIleGluProLeuGlylleAlaProThrLysAlaLysArgArgVal 
7228 TATAAAGTAATAAAAATTGAACCATTAGGAATA6CACCCACCAAGGCAAAGAGAAGAGTG 

Va 1 G 1 n A rgG 1 uLy s A rgA 1 aVa 1 G 1 y II eVa 1 G 1 y A 1 aMetPheLeuG 1 yPheLeuG 1 y 523 
GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGTAG6AGCTATGTTCCTTGGGTTCTTGGGA 

AlaAlaGl ySerTh rMetG 1 yA 1 aVa 1 Se rLeuTh rLeuTh rVa lGlnA 1 a ArgG InLeu 
7318 GCAGCAGGAAGCACTATGGGCGCAGTGTCATTGACGCTGACGGTACAGGCCAGACAATTA 

LeuSerGlyl leValGlnGlnGlnAsnAsnLeuLeuArgAlal leGluAlaGlnGlnHis 563 
TTGTCT6GTATAGTGCAACAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAACAT 

LeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGInAlaArgValLeuAlaValGlu 
7168 CTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAGTCCTGGCTGTGGAA 

ArgTyrLeuArgAspGlnGlnLeuLeuGlylleTrpGlyCysSerGlyLysLeuIleCys 603 
AGATACCTAAGGGATCAACAGCTCCTAGGGATTTGGGGTTGCTCTGGAAAACTCATTTGC 

ThrThrAlaValProTroAsnAlaSerTrpSerAsnLysSerLeuGluAsDlleTrDAsp 
7588 ACCACTGCTGT6CCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAAGACATTTGGGAT 

AsnMetThrTrpMetGlnTrpGluArgGluI leAspAsnTyrThrAsnThrl leTyrThr 643 
AACATGACCTGGATGCAGTGGGAAAGAGAAATTGACAATTACACAAACACAATATACACC 

LeuLeuGluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAsp 

7708 TTACTTGAAGAATCGCAGAACCAACAAGAAAAGAATGAACAAGAATTATTAGAATTGGAT 

LysTrpAlaSerLeuTrpAsnTrpPheSerl leThrAsnTrpLeuTrpTyrl leLysI le 683 
AAGTGG6CAAGTTTGTGGAATTGGTTTAGCATAACAAACTGGCTGTGGTATATAAAGATA 

, Phel leMetl leValGlyGlyLeuValGlyLeuArgl leValPheAlaValLeuSerl le E 

7828 TTCATAATGATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTGCTTTCTATA 

ValAsnArgValArgGlnGlyTyrSerProLeuSerPheGlnThrArgLeuProValPro 723 
GTGAATAGAGTTAGGCAGGGATACTCACCATTGTCATTTCAGACCCGCCTCCCAGTCCCG N 

« ArgGlyProAspArgProAspGlyl 1 eG I uG 1 uG 1 uG 1 yG 1 yG 1 u A rgAspA rgAspA rg 
7948 AGGGGACCCGACAGGCCCGACGGAATCGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGA 

SerVal ArgLeuVa 1 AspG 1 yPheLeuA 1 aLeuI 1 eTrpG 1 uAspLeuA rgSerleuCys 763 V 
TCCGTTCGATTAGTGGATGGATTCTTAGCACTTATCTGGGAAGATCTGCGGAGCCTGT6C 

LeuPheSerTyrArgArgLeuArgAspLeuLeuLeuIleAlaAlaArgThrValGluIle 
8068 CTCTTCAGCTACCGCCGCTTGAGAGACTTACTCTTGATT6CAGCGAGGACTGTGGAAATT 

LeuGlyHisArgGlyTrpGluAlaLeuLysTyrTrpTrpSerLeuLeuGlnTyrTrpIle 803 
CTGGGGCACAGGGGGTGGGAAGCCCTCAAATATTGGTGGAGTCTCCTGCAGTATTGGATT 

GlnGlyLeuLysAsnSerAlaValSerTrpLeuAsnAlaThrAlalleAlaValThrGlu 
8188 CAGGAACTAAAGAATAGTGCTGTTAGCTGGCTCAACGCCACAGCTATAGCAGTAACTGAG 

GlyTh r AspArgVa 1 1 leGluVal AlaGlnArgAl aTy r ArgA 1 a I leLeuH i s 1 1 eH i s 843 
GGGACAGATAGGGTTATAGAAGTAGCACAAAGAGCTTATAGAGCTATTCTCCACATACAT 

ArgArglleArgGlnGlyLeuGluArgLeuLeuLeuOC 
8308 AGAAGAATTAGACAGGGCTTGGAAAGGCTTTTGCTATAAGATGGGTGGCAAGTGGTCAAA 

ACGTAGTATGGGTGGATGGTCTGCTATAAGGGAAAGAATGAGACGAGCTGAGCCACGAGC 
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8428 TGA6CCAGCA6CA6AT66G6TGGGA6CAGTATCTC6AGACCT6GAAAAACATGGAGCAAT 
CACAAGTAGCAATACAGCAGCTACTAATGCTGATTGTGCCTGGCTAGAAGCACAAGAGGA 
8548 GGAAGAGGTGGGTTTTCCAGTCAGACCTCAGGTACCTTTAAGACCAATGACTTACAAGGC 

AGCTTTAGATATTAGCCACTTTTTAAAAGAAAAGGGGGGA CTGGAAGGGCTAATTTGGT 
8667 CCCAAAGAAGACAAGAGATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 

ATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCACTGACCTTTGGATGGTGCT 
8787 TCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAATGAAGGAGAGAACAACA 

GCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGAGAAAGAAGTGTTAGTGT 
8907 GGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGAGCTGCATCCGGAGTACT 

ACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGCTGGGGACTTTCCAGGGA 
9027 GGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATGCTGCATATAAGCAGCTGC 

TTTTTGCCTGTACTG GGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

-*-R U5-^ 

9146 TAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCA AGTAGT 
GTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGT 
-•-U5 

9265 GTGGAAAAATCTCTAGCAG 
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ptac 5 Promotor 



Met I leVa 1 
ATGATCGTA 



GlnAsnLeuGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrp 
748 CA6AATCTGCAGGGGCAAAT6GTACATCAGGCCATATCACCTAGAACTTTAAATGCATGG 

ValLysValValGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeu 181 
GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 

SerGluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGG6GGACATCAA 

AlaAlaMetGlnMetLeuLysGluThrlleAsnGluGluAlaAlaGluTrpAsoArgVal 221 
GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCA6AATGGGATAGAGTG 

HisProValHisAlaGlyProI leAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATG CAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 

IleAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProPro 261 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leVaiArg 
1108 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerl leLeuAspl leArgGlnGlyProLysGluProPheArgAsp 301 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TAT6TAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLys 341 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 

A 1 aLeuG 1 y Pro A 1 a A 1 aTh rLeuG 1 uG 1 ufletMetThr A 1 aCysG 1 nG 1 yVa 1 G 1 yG 1 y 
1348 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 

ProGlyHisLysAlaArgValLeu Stop Stop 



CCCGGGCATAAAGCAAGAGTTTTGTGATAG Ptac 5 
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ptac 5 Promotor 



MetlleVal 141 
ATGATCGTA 



GlnAsnLeuGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrp 
748 CAGAATCTG CAGGGGCAAATGGTACATCAG6CCATATCACCTAGAACTTTAAATGCATGG 

ValLysValValGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeu 181 
GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 

SerGluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 

AlaAlaMetGlnMetLeuLysGluThrlleAsnGluGluAlaAlaGluTrpAspArgVal 221 

GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 

HisProValHisAlaGlyProIleAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 

IleAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProPro 261 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

IleProValGlyGluIleTyrLysArgTrpIlelleLeuGlyLeuAsnLysIleValArg 
1108 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGIuProPheArgAsp 301 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLys 341 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 

AlaLeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGly 
1348 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 



G 



ProGlyHisLysAlaArgValLeuAlaGluAlaMetSerGlnValThrAsnProAlaAsn 381 
CCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCCATGAGCCAAGTAACAAATCCAGCTAAC 

, IleMetMetGlnArgGlyAsnPheArgAsnGlnArgLysThrValLysCysPheAsnCys 
1468 ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGAAAGACTGTTAAGTGTTTCAATTGT 

GlyLysGluGlyHisIleAlaLysAsnCysArgAlaProArgLysLysGlyCysTrpArg 421 
GGCAAA6AAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAGA 

CysGlyArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGly 

PhePheArgG 

1588 TGTGGAAGGGAAGGACACCAAATGAAAGATTGCACTGAGAGACA6GCTAATTTTTTAGGG 

LyslleTrpProSerTyrLysGlyArgProGlyAsnPheLeuGlnSerArgProGluPro 461 
luAspLeuAlaPheLeuGlnGlyLysAlaArgGluPheSerSerGluGlnThrArgAIa 23 
AAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGAGCCA 

ThrAlaProProGluGluSerPheArgPheGlyGluGluLysThrThrProSerGlnLys 
,^„ A ?Q?SC ProThrAr 9 Ar 9GluLeuGlnValTrpGlyGlyGluAsnAsnSerLeuSerGluA 
1708 ACAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAGGAGAAAACAACTCCCTCTCAGAAG 

GInGluProIleAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlyAsn 501 
laGlyAlaAspArgGlnGlyThrValSerPheAsnPheProGlnlleThrLeuTrpGln 63 
CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAAC 
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AspProSerSerGlnOC 

ArgProLeuValThrlleArglleGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyA 
1828 GACCCCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAG 

laAspAspThrValLeuGluGluMetAsnLeuProGlyLysTrpLysProLysMetlle 103 
CAGATGATACAGTATTAGAAGAAATGAATTTGCCA6GAAAATGGAAACCAAAAATGATAG 

n GlyGlylleGlyGlyPhelleLysValArgGlnTyrAspGlnlleProValGluIleCysG 
1948 GGGGAATTGGAGGTTTTATCAAAGTAAGACAGTACGATCAGATACCTGTAGAAATCTGTG 

lyHisLysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArg 143 
GACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAA 

AsnLeuLeuThrGlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrValP 
2068 ATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTAC 

roVa 1 LysLeuLysProG 1 yMetAspG 1 yProLysVa 1 LysG 1 nTrpProLeuTh rG 1 u 183 
CAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATGGCCATTGACAGAAG 

^^^GluLysIleLysAlaLeuValGluIIeCysThrGluMetGluLysGluGlyLysIleSerL 
2188 AAAAAATAAAAGCATTAGTAGAGATATGTACAGAAATGGAAAAGGAAGGGAAAATTTCAA 

ysIleGlyProGluAsnProTyrAsnThrProValPheAlalleLysLysLysAspSer 223 
AAATTGGGCCTGAAAATCCATACAATACTCCAGTATTTGCTATAAAGAAAAAAGACAGTA 

„„„ ThrLysTrpArgLysLeuValAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpG 
2308 CTAAATGGA6AAAACTAGTAGATTTCAGAGAACTTAATAAAAGAACTCAAGACTTCTGGG 

JuValGlnLeuGlylleProHisProAlaGlyLeuLysLysLysLysSerValThrVal 263 
AAGTTCAGTTAGGAATACCACACCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTAT 

LeuAspValGlyAspAlaTyrPheSerValProLeuAspLysAspPheArgLysTyrThrA 
2428 TGGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATAAAGACTTTAGSAAGTATACTG 

laPheThrl leProSerl leAsnAsnGluThrProGlyl leArgTyrGlnTyrAsnVal 303 
CATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGC 

nimi LeuProGlnGlyTrpLysGlySerProAlallePheGlnSerSerMetThrLysIleLeuG 
2548 TGCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATCTTAG 

1 u ProPheArgLysG 1 nAsnProAsp I leVal 1 1 eTy rG 1 nTy rMetAspAspLeuTy r 343 
AGCCTTTTAGAAAACAGAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTATG 

2668 TAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAACTGAGACAGCATC 

euLeuArgTrpGlyPheThrThrProAspLysLysHisGlnLysGluProProPheLeu 383 
TGTTGAGGTGGGGATTTACCACACCAGACAAAAAACATCAGAAAGAACCTCCATTCCTTT 

o-Ton^RH'lJ5I y rI yr ^^ u '- eu HisProAspLysTrpThrValGlnProIleMetLeuProGluL 
2788 GGAT6GGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTATAATGCTGCCAGAAA 

ysAspSerTrpThryal AsnAspI 1 eG 1 nLysLeuVa 1 G 1 yLysLeuAsnTrpA 1 aSer 423 
AAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGAAAATTGAATTGGGCAAGTC 

™™ G I P I leTyrAlaGly 1 1 eLy sVa 1 LysG 1 nLeuCysLysLeuLeuA rgG 1 yThrLysAlaL 
2908 AGATTTATGCAGGGATTAAAGTAAAGCAGTTATGTAAACTCCTTAGAGGAACCAAAGCAC 

euThrG 1 uVa 1 1 1 eProLeuThrG luGluAlaGl uLeuG 1 uLeuA 1 aG 1 uAsnA rgG 1 u 463 P 
TAACAGAAGTAATACCACTAACAGAAGAAGCAGAGCTAGAACTGGCAGAAAACAGGGAGA q 
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1 1 eLeuLysG 1 uProVa lHisGluVal Ty rTy rAspProSerLysAspLeuVa 1 A laG 1 u I 
3028 TTCTAAAAGAACCAGTACATGAAGTATATTATGACCCATCAAAAGACTTAGTAGCAGAAA 

leGlnLysGlnGlyGlnGlyGlnTrpThrTyrGlnlleTyrGlnGluProPheLysAsn 503 
TACAGAAGCAGGGGCAAGGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATC 

LeuLysThrGlyLysTyrAlaArgMetArgGlyAlaHisThrAsnAspValLysGlnLeuT 
3148 TGAAAACAGGAAAGTATGCAAGGATGAGGGGTGCCCACACTAATGATGTAAAACAGTJ 

hrGluAlaValGlu LvsVa 1 Se rTh rG 1 use r I leva 1 1 leTrpG 1 yLy s 1 1 eProLy s 543 
ptac b 
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ARV GAG pl6 - synthetic Parts A and B 

5' arv 234 3' 

MetGlnArgGlyAsnPheArdAsnGlnArgLysThrValLysCysPheAsnCysGlyLys 

tattatgcaaagaggtaacttcaggEatcaaagaaagaccgttaagtgtttcaactgtggtaag 
ataatacgtttctccattgaagtccltalgtttctttctggcaattcacaaagttgacaccattc 

3' * arv 235 ~ 5 71 

10 mnll, 23 hinfl, 

GluGlyHisIleAlaLysAsnCysArgAlaProArgLysLysAlaCysTrpArgCysGly 
63 GAAGGTCACATCGCTAAGAACTGTAGAGCTCCAAGAAAGAAGGCTTGTTGGAGATGTGGT 
CTTCCAGTGTAGCGATTCTTGACATCTCGAGGTTCTTTCTTCCGAACAACCTCTACACCA 

76 ddel, 88 ban2 hgiA hgiJll sacl sdul, 89 alul. 

ArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGlyLysIle 
123 AGAGAAGGTCACCAAATGAAGGACTGTACCGAAAGACAAGCTAACTTCTTGGGTAAGATC 
TCTCTTCCAGTGGTTTACTTCCTGACATGGCTTTCTGTTCGATTGAAGAACCCATTCTAG 

129 bstE2, 131 hph, 148 rsal, 161 alul, 178 bglll xho2, 179 
sau3a, 

TrpProSerTyrLysGlyArgProGlyAsnPheLeuGlnSerArgProGluProThrAla 
183 TGGCCATCTTACAAGGGTAGACCAGGTAACTTCTTGCAATCCAGACCAGAACCAACCGCT 
ACCGGTAGAATGTTCCCATCTGGTCCATTGAA6AACGTTAGGTCTGGTCTTGGTTGGCGA 

183 ball cfrl hael, 184 haelll, 199 accl, 204 apyl ecorll sc 
rFl, 

ProProG 1 uG 1 uSerPheA rgPheG lyGluGl uLysThrThrProSe rG 1 nLysG InG 1 u 
243 CCACCTGAAGAAAGTTTCAGGTTCGGTGAAGAAAAGACCACCCCATCTCAAAAGCAAGAA 
GGTGGACTTCTTTCAAAGTCCAAGCCACTTCTTTTCTGGTGGGGTAGAGTTTTCGTTCTT 

249 mboll, 267 hph, 270 mboll, 

ProIleAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlyAsnAspPro 
303 CCAATCGACAAGGAATTGTACCCATTGACCTCTTTGAGATCCTTGTTCGGTAACGATCCC 
GGTTAGCTGTTCCTTAACATGGGTAACTGGAGAAACTCTAGGAACAAGCCATTGCTAGGG 

307 taql, 320 rsal, 331 mnll, 339 xho2, 340 sau3a, 357 sau3a, 
361 mnll, 362 aval xhol, 

SerSerGlnOP AM 
363 TCGAGCCAATGATAG 

AGCTCGGTTACTATCAGCT 

363 taql, 377 accl hindll sail 
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M6tS6P 

PYK Promoter ATGTCT 



ArgIleAspCysSerAlaThr61uLysLeuTrDValThrValTyrTyrGlyValProVal 51 
AGAATC6AT GTAGTGCTACAGAAAAATTGTGGGTCACAGTTTATTATGGAGTACCTGTG 

TrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAspThrGlu 
5908 TGGAAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATATGATACAGAG 

ValHisAsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGluVal 91 
GTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 

ValLeuGlyAsnValThrGluAsnPheAsnMetTrpLysAsnAsnMetValGluGlnMet 
6028 GTATTGGGAAATGTGACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAGATG 

GlnGluAspI lelleSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThrPro 131 
CAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCA 

LeuCysValThrLeuAsnCysThrAspLeuGlyLysAlaThrAsnThrAsnSerSerAsn 
6148 CTCTGTGTTACTTTAAATT6CACTGATTTGGGGAAGGCTACTAATACCAATAGTAGTAAT 

TrpLysGluGluI leLysGlyGluI leLysAsnCysSerPheAsnl leThrThrSerlle 171 
TGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGCATA 

ArgAspLysI leGlnLysGluAsnAlaLeuPheArgAsnLeuAspValValProI leAsp 
6268 AGAGATAAGATTCAGAAAGAAAATGCACTTTTTC6TAACCTTGATGTAGTACCAATAGAT 

AsnAlaSerThrThrThrAsnTyrThrAsnTyrArgLeuIleHisCysAsnArgSerVal 211 
AATGCTAGTACTACTACCAACTATACCAACTATAGGTTGATACATTGTAACAGATCAGTC 

IleThrGlnAlaCysProLysValSerPheGluProIleProIleHisTyrCysThrPro 
6388 ATTACACAGGCCTGTCCAAAGGTATCATTTGAGCCAATTCCCATACATTATTGTACCCCG 

AlaGlyPheAlalleLeuLysCysAsnAsnLysThrPheAsnGlyLysGlyProCysThr 251 
GCTGGTTTTGCGATTCTAAAGTGTAATAATAAAACGTTCAATGGAAAAGGACCATGTACA 
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AsnValSerThrValGlnCysThrHisGlylleArgProIleValSerThrGlnLeuLeu 
6508 AAT6TCAGCACAGTACAATGTACACATGGAATTAGGCCAATAGTGTCAACTCAACTGCTG 

LeuAsnG 1 ySerLeuA JaGluGluGluValVall leA rgSer AspAsnPheTh rAsnAsn 291 
TTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGACAATTTCACGAACAAT 

AlaLysThrl lei leValGlnLeuAsnGluSerValAlal leAsnCysThrArgProAsn 
6628 6CTAAAACCATAATAGTACAGCTGAATGAATCTGTAGCAATTAACTGTACAAGACCCAAC 

AsnAsnThrArgLysSerlleTyrlleGlyProGlyArgAlaPheHisThrThrGlyArg 331 
AACAATACAAGAAAAAGTATCTATATAGGACCAGGGAGAGCATTTCATACAACAGGAAGA 

HelleGlyAspIleArgLysAlaHisCysAsnl leSerArgAlaGlnTrpAsnAsnThr 
6748 ATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGAGCACAAT6GAATAACACT 

LeuGluGlnl leVal Ly s LysLeuA rgG 1 uG 1 nPheG 1 yAsnAsnLysTh rl leVal Phe 371 
TTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGGAATAATAAAACAATAGTCTTT 

AsnGlnSerSerGlyGlyAspProGluI leValMetHisSerPheAsnCysArgGlyGlu 
6868 AATCAATCCTCAGGAGGGGACCCAGAAATTGTAATGCACAGTTTTAATTGTAGAGGGGAA 

PhePheTyrCysAsnThrThrGlnLeuPheAsnAsnThrTrpArgLeuAsnHisThrGlu 411 
TTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGGAGGTTAAATCACACTGAA 

„ GlyThrLysGlyAsnAspThrl lei leLeuProCysArgl leLysGlnl lei leAsnMet 

6988 GGAACTAAAGGAAATGACACAATCATACTCCCATGTAGAATAAAACAAATTATAAACATG 

TrpGlnGluValGlyLysAlaMetTyrAlaProProIleGlyGlyGlnlleSerCysSer 451 N 
TGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATTGGAGGACAAATTAGTTGTTCA 

SerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyThrAsnValThrAsnAspThr 
7108 TCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTACAAATGTAACTAATGACACC 

GlyValPheArgProGlyGlyGlyAspMetArgAspAsnTrpArgSerGluLeuTyrLys 491 V 
GAGGTCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

JyrLysVal I leLysI leGluProAsnSerValSer 
7228 TATAAAGTAATAAAAATTGAACCAAATTCGGTATCTTGA 



PYK Terminator 



FIG. 11B 



REPLACEMENT SHEET 



Nucleotide Metl leAspLysAlaGlnGluGluHisGluLysTyrHisSerAsnTrp 

positions 1 AGGXAACAG: : : : ATGAT : GA : AAGGCACAAGAAGAACATGAGAAATATCACAGTAATTGG 
relative to TCCXTTGTC: : : :TACTA:CT:TTCCGTGTTCTTCTTGTACTCTTTATAGTGTCATTAACC 



Figure 5. 



32 mboll, 38 nlalll, 



ArgAlaMetAlaSerAspPheAsnLeuProProValValAlaLysGluIleValAlaSer 
3820 62 AGAGCCATGGCTAGTGATTTTAACCTGCCACCTGTAGTAGCAAAAGAAATAGTAGCCAGC 

TCTCGGTACCGATCACTAAAATTGGACGGTGGACATCATCGTTTTCTTTATCATCGGTCG 

66 ncol, 67 nlalll, 118 nspBII pvull, 119 alul, 

™™ CysAspLysCysGlnLeuLysGlyGluAlaMetHisGIyGlnValAspCysSerProGly 
3880 122 TGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACAAGTAGACTGTAGTCCAGGA 

ACACTATTTACAGTCGATTTTCCTCTTCGGTACGTACCTGTTCATCTGACATCAGGTCCT 

a a a A 

135 alul, 151 nlalll, 152 nsil ava3, 155 nlalll, 164 accl, 1 
76 apyl bstXI ecorll scrFl, 

,o,n ,„„ HeTrpGlnLeuAspCysThrHisLeuGluGlyLysIIelleLeuValAlaValHlsVal 

3940 182 ATATGGCAACTAGATTGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTA 

TATACCGTTGATCTAACATGTGTAGATCTTCCTTTTTAATAGGACCATCGTCAAGTACAT 

^ * a a 

198 rsal, 205 xbal, 223 apyl ecorll scrFl, 236 nlalll, 

,„„„ „,„ AlaSerGlyTyrlleGluAlaGluVallleProAlaGluThrGlyGlnGluThrAlaTyr 
4000 242 GCCAGTGGATATATAGAAGCAGAAGTTATTCCAGCAGAGACAGGGCAGGAAACAGCATAT 

CGGTCACCTATATATCTTCGTCTTCAATAAGGTC6TCTCTGTCCCGTCCTTTGTCGTATA 

a 

263 xmnl, 

inc „ „„„ PheLeuLeuLysLeuAlaGlyArgTrpProValLysThrlleHisThrAspAsnGlySer 

4060 302 TTTCTCTTAAAATTAGCAGGAAGATGGCCAGTAAAAACAATACATACAGACAATGGCAGC 

AAAGAGAATTTTAATCGTCCTTCTACCGGTCATTTTTGTTATGTATGTCTGTTACCGTCG 

AAA A 

321 mboll, 326 ball cfrl hael, 327 haelll, 357 bbv fnu4hl, 

,,.„„ AsnPheTh rSe rJhrThrva 1 Ly sA 1 aA 1 aCy sTrpTrpA laGlyl leLysG 1 nG 1 uPhe 

4120 362 AATTTCACCAGTACTACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCAGGAATTT 

TTAAAGTGGTCATGATGCCAATTCCGGC6GACAACCACCCGTCCCTAGTTCGTCCTTAAA 

^ AA /\ A A A 

366 hph, 371 seal, 372 rsal, 385 haelll, 386 fnu4hl nsbll, 4 
05 binl, 406 dpnl sau3a, 

,,-ion ,00 GlyHeProTyrAsnProGlnSerGlnGlyValValGluSerMetAsnAsnGluLeuLys 

4180 422 CSCATTCCCTACAATCCCCAAA^ 

CCGTAAGGGATGTTAGGGGTTTCAGTTCCTCATCATCTTAGATACTTATTACTTAATTTC 

A 

423 bsml, 458 hinfl, 

i,o/,n /,oo t^U?U?9M]^l A C?$5S5 ln $ laGluHisLeul -ysThrAlaValGlnMetAla 

4240 482 AAAATTATAGGACAGGTAAGAGATCAGGCTGAACACCTTAAGACAGCAGTACAAATGGCA 

TTTTAATATCCTGTCCATTCTCTAGTCCGACTTGTGGAATTCTGTCGTCATGTTTACCGT 

A A A 

503 dpnl sau3a, 518 af 111. 530 rsal, 

„7nn c,.o y§l?05UgHisAsn^eLysArgLysGlyGlyI leGlyGlyTyrSerAIaGlyGluArg 

4300 542 GTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGGATACAGTGCAGGGGAAAGA 

CATAAGTAGGTGTTAAAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCT 

547 fokl, 557 ahalll. 
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I leValAspI lei leAlaThrAsnl leGlnThrLysGluLeuGlnLysGlnl leThrLys 
4360 602 ATAGTAGACATAATAGCAACAGACATACAAACTAAAGAACTACAAAAGCAAATTACAAAA 

TATCATCTGTATTATCGTTGTCTGTATGTTTGATTTCTTGATGTTTTCGTTTAATGTTTT 

605 accl, 

IleGlnAsnPheArgValTyrTyrArgAsDAsnLysAsoProLeuTrpLysGlyProAla 
4420 662 ATTCAAAATTTTCGGGTTTATTACAGGGACAACAAAGATCCCCTTTGGAAAGGACCAGCA 

TAAGTTTTAAAAGCCCAAATAATGTCCCTGTTGTTTCTAGGGGAAACCTTTCCTGGTC6T 

697 xho2, 698 dpnl sau3a, 713 asul ava2, 

LysLeuleuTrpLysGlyGluGlyAlaValVal I leGlnAspAsnSerAspl leLysVal 
4480 722 AAGCTTCTCTGGAAAGGTGAAGGGGCA6TAGTAATACAAGATAATAGTGACATAAAAGTA 

TTCGAAGAGACCTTTCCACTTCCCCGTCATCATTATGTTCTATTATCACTGTATTTTCAT 

722 hindlll, 723 alul, 737 hph, 

ValProArgArgLysAlaLysI lelleArgAspTyr61ylysGlnMetAlaGlyAspAsp 
4540 782 GTGCCAAGAAGAAAAGCAAAAATCATTAGGGATTATGGAAAACAGATGGCAGGTGATGAT 

CACGGTTCTTCTTTTCGTTTTTAGTAATCCCTAATACCTTTTGTCTACCGTCCACTACTA 

789 mboll, 833 hph, 

CysValAlaSerArgGlnAspGluAspAM 
4600 842 TGTGTGGCAAGTAGACAGGATGAGGATTAGTCGACGGAATTCTTTAGTAAAACACC 

ACACACCGTTCATCTGTCCTACTCCTAATCAGCTGCCTTAAGAAATCATTTTGTGG 



852 accl, 859 fokl, 863 mnll, 871 accl hindll sail, 872 taql 
, 878 ecorl. 
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SOD 

FietAlaThrLysAlaValCysValLeuLysGlyAspGlyProValGlnGlyllelleAsri 
1 CATGGCGACGAAGGCCGTGTGCGTGCTGAAGGGCGACGGCCCAGTGCAGGGCATCATCAAT 
CGCTGCTTCCGGCACACGCACGACTTCCCGCTGCCGGGTCACGTCCCGTAGTAGTTA 

PheG 1 uG 1 nLysG 1 uSerAsnG lyProVa 1 LysVa ITrpG lySerl leLysG 1 yLeuThr 
62 TTCGAGCAGAAGGAAAGTAATGGACCAGTGAAGGTGTGGGGAAGCATTAAAGGACTGACT 
AAGCTCGTCTTCCTTTCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGA 

GluGlyLeuHisGlyPheHisValHisGluPheGlyAspAsnThrAlaGlyCysThrSer 
122 GAAGGCCTGCATGGATTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGT 
CTTCCGGACGTACCTAAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCA 

AlaGlyProHisPheAsnProLeuSerArgLysHisGlyGlyProLysAspGluGluArg 
182 GCAGGTCCTCACTTTAATCCTCTATCCAGAAAACACGGTGGGCCAAA6GATGAAGAGAGG 
CGTCCAGGAGTGAAATTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCC 

HisValGlyAspLeuGlyAsnValThrAlaAspLysAspGlyValAlaAspValSerlle 
242 CATGTTGGA6ACTTGGGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATGTGTCTATT 
GTACAACCTCTGAACCCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAA 

GluAspSerVallleSerLeuSerGlyAspHisCysI ielleGlyArgThrLeuValVal 
302 GAAGATTCTGTGATCTCACTCTCAGGAGACCATTGCATCATTGGCCGCACACTGGTGGTC 
CTTCTAAGACACTAGAGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAG 

HisGluLysAlaAspAspLeuGlyLysGlyGlyAsnGluGluSerThrlysThrGlyAsri 
362 CATGAAAAAGCAGATGACTTGGGCAAAGGTGGAAATGAAGAAAGTACAAAGACAGGAAAC 
GTACTTTTTCGTCTACTGAACCCGTTTCCACCTTTACTTCTTTCATGTTTCTGTCCTTTG 

ENV 5B 

AlaGlySerArgLeuAlaCysGlyVallleGlylleAlaMetAlalleGluAlaGlnGln 
422 GCTGGAAGTCGTTTGGCTTGTGGTGTAATTGGGATCGCCATGGCTATCGAAGCTCAACAA 
CGACCTTCAGCAAACCGAACACCACATTAACCCTAGCGGTACCGATAGCTTCGAGTTGTT 

HisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGlnAlaArgValLeuAlaVal 
482 CACTTGCTGCAGTTGACCGTTTGGGGTATCAAGCAGTTGCAGGCTAGAGTTTTGGCTGTT 
GTGAACGACGTCAACTGGCAAACCCCATAGTTCGTCAACGTCCGATCTCAAAACCGACAA 

GluArgTyrLeuArgAspGlnGlnLeuLeuGlyl leTrpGlyCysSerGlyLysLeuI le 
542 GAAAGATACTTGAGAGATCAACAATTGTTGGGTATCTGGGGTTGTTCTGGTAAGTTGATT 
CTTTCTATGAACTCTCTAGTTGTTAACAACCCATAGACCCCAACAAGACCATTCAACTAA 

CysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluAspIleTrp 
602 TGTACCACCGCTGTTCCCTGGAACGCTTCTTGGTCTAACAAGTCTTTGGAAGACATCTGG 
ACATGGTGGCGACAAGGGACCTTGCGAAGAACCAGATTGTTCAGAAACCTTCTGTAGACC 

AspAsnMetThrTrpMetGlnTrpGluArgGluI leAspAsnTyrThrAsnThrl leTyr 
662 GACAACATGACCTGGATGCAATGGGAAAGAGAAATCGACAACTACACCAACACCATCTAC 
CTGTTGTACTGGACCTACGTTACCCTTTCTCTTTAGCTGTTGATGTGGTTGTGGTAGATG 

ThrLeuLeuGluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeu 
722 ACCTTGTTGGAGGAATCTCAAAACCAACAAGAAAAGAACGAACAAGAATTGTTGGAATTG 
TGGAACAACCTCCTTAGAGTTTTGGTTGTTCTTTTCTTGCTTGTTCTTAACAACCTTAAC 

AspLysTrpAlaSerLeuTrpAsnTrpPheSerlleThrAsnTrpAM 
782 GACAAGTGGGCAAGCTTGTGGAACTGGTTCTCTATCACCAACTGGTAG 

CTGTTCACCCGTTCGAACACCTTGACCAAGAGATAGTGGTTGACCATCAGCT 

Translated Mol. Weight = 30414.22 
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SphI 
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;saif) 



IC ATG f.P.T ATA GTG CAG AAT C 1 
^^GGA TAT CAC GTC TTA C 




GGG CAT AAA GCA AGA GTT TTG TGA TAG I , 

CCC GTA TTT CGT TCT CAA AAC ACT ATC AGTT1 



synthetic 
oligonucleotides 



p25 gag (706 bp) 



synthetic 
oligonucleotides 



R 3mHT -iJ pBSlOO cut 
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c 


1 

Met Pro He Val Gin 
ATG CCT ATA GTG CAG 


Asn 
AAT 


Leu Gin Gly 
CTG CAG GGG 


10 
Gin 
CAA 


Met 
ATG 


Val 
GTA 


His 
CAT 


Gin 
CAG 


Ala 
GCC 


20 

He Ser Pro Arg Thr 
ATA TCA CCT AGA ACT 


Leu 
TTA 


Asn Ala Trp Val 
AAT GCT TGG GTA 


Lys Val 
AAA GTA 


Val 
GTA 


Glu 
GAA 


30 

Glu Lys Ala Phe Ser Pro 
GAA AAG GCT TTC AGC CCA 


GlU 
GAA 


Val lie Pro 
GTA ATA CCC 


40 

Met 

ATG 


Phe 
TTT 


Ser 
TCA 


Ala 
GCA 


Leu 
TTA 


Ser 
TCA 


50 

Glu Gly Ala Thr Pro 
GAA GGA GCC ACC CCT 


Gin 
CAA 


Asp Leu Asn 
GAT TTA AAC 


Thr 
ACC 


Met 
ATG 


Leu 
CTA 


Asn 
AAC 


Thr 
ACA 


60 
Val 

GTG 


Gly Gly His Gin Ala Ala Met 61n Met 
GGG GGA CAT CAA GCA GCC ATG CAA ATG 


70 
Leu 
TTA 


Lys 
AAA 


Glu 
GAG 


Thr 
ACT 


He 
ATC 


80 

Asn Glu Glu Ala Ala Glu Trp Asp Arg Val 
AAT GAG GAG GCT GCC GAA TGG GAT AGA GTG 


His 
CAT 


Pro 
CCA 


Val 
GTG 


His Ala 
CAT GCA 



90 100 
Gly Pro lie Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp 
GGG CCT ATT GCA CCA GGC CAA ATG AGA GAA CCA AGG GGA AGT GAC 
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110 

He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met 
ATA GCA GGA ACT ACT AGT ACC CTT CAG GAA CAA ATA GGA TGG ATG 

120 130 

Thr Asn Asn Pro Pro He Pro Val Gly Glu lie Tyr Lys Arg Trp 

ACA AAT AAT CCA CCT ATC CCA GTA GGA GAA ATC TAT AAA AGA TGG 

mo 

He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr 
ATA ATC CTG GGA TTA AAT AAA ATA GTA AGA ATG TAT AGC CCT ACC 

150 160 

Ser lie Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 

AGC ATT CTG GAC ATA AGA CAA GGA CCA AAG GAA CCC TTT AGA GAT 

170 

Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser 
TAT GTA GAC CGG TTC TAT AAA ACT CTA AGA GCC GAA CAA GCT TCA 

180 190 

Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn 

CAG GAT GTA AAA AAT TGG ATG ACA GAA ACC TTG TTG GTC CAA AAT 

200 

Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala 
GCA AAC CCA GAT TGT AAG ACT ATT TTA AAA GCA TTG GGA CCA GCA 

210 220 

Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 

GCT ACA CTA GAA GAA ATG ATG ACA GCA TGT CAG GGA GTG GGG GGA 

230 232 
Pro Gly His Lys Ala Arg Val Leu OP 
CC C GGG CAT AAA GCA AGA GTT TTG TGA TAG 

Translated Mol. Weight = 25700.75 
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SOD— ► 

MetAlaThrLysAla 
ATGGCTACAAAGGCT 
TACCGATGTTTCCGA 

ValCysValLeuLysGlyAspGlyProValGlnGlyllelleAsnPheGluGlnLysGlu 
1383 GTTTGTGTTTTGAAGGGTGACGGCCCAGTTCAAGGTATTATTAACTTCGAGCAGAAGGAA 
CAAACACAAAACTTCCCACTGCCGGGTCAAGTTCCATAATAATTGAAGCTCGTCTTCCTT 

SerAsnGlyProValLysValTrpGlySerlleLysGlyLeuThrGluGlyLeuHisGly 
1M3 AGTAATGGACCAGTGAAG6TGTGGG6AAGCATTAAAGGACTGACTGAAGGCCTGCATGGA 
TCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGACTTCCGGACGTACCT 

PheHisValHisGluPheGlyAspAsnThrAlaGlyCysThrSerAlaGlyProHisPhe 
1503 TTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGTGCAGGTCCTCACTTT 
AAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCACGTCCAGGAGTGAAA 

AsnProLeuSerArgLysHisGlyGlyProLysAspGluGIuArgHisValGlyAspLeu 
1563 AATCCTCTATCCAGAAAACACGGTGGGCCAAAGGATGAAGAGAGGCATGTTGGAGACTTG 
TTAGGAGATAGGTCTTTT6TGCCACCCGGTTTCCTACTTCTCTCCGTACAACCTCTGAAC 

GlyAsnValThrAlaAspLysAspGlyValAlaAspValSerl leGluAspSerVal I le 
1623 GGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATGTGTCTATTGAAGATTCTGTGATC 
CCGTTACACTGACGACTGTTTCTACCACACC6GCTACACAGATAACTTCTAAGACACTAG 

SerLeuSerGlyAspHisCysIlelleGlyArgThrLeuValValHisGluLysAlaAsp 

1683 TCACTCTCAGGAGACCATTGCATCATTGGCCGCACACTGGTGGTCCATGAAAAAGCAGAT 
AGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAGGTACTTTTTCGTCTA 

,,,, AspLeuGlyLysGlyGlyAsnGluGluSerThrLysThrGlyAsnAlaGlySerArgLeu 
1743 GACTTGGGCAAAGGTGGAAATGAAGAAAGTACAAAGACAGGAAACGCTGGAAGTCGTTTG 
CTGAACCCGTTTCCACCTTTACTTCTTTCATGTTTCTGTCCTTTGCGACCTTCAGCAAAC 

linker p31 
, ftrt , AlaCysGlyVallleGlylleAlaGlnAsnSerGlyValGIyAlaMetAlaMetAlaSer 
1803 6CTTGTG6TGTAATTGGGATCGCCCAG AAnCfl| 3 6T6nG§A§CCATQGC CA'''GGrTA6T 
CGAACACCACATTAACCCTAGCGGGTCTTAA GTCCACAACCTCGGTACCGGTA C 

, nr , AspPheAsnLeuProProValValAlaLysGluIleValAlaSerCysAspLysCysGln 
1863 GATTTTAACCTGCCACCTGTAGTAGCAAAAGAAATAGTAGCCAGCT6TGATAAATGTCAG 
CTAAAATTGGACGGTGGACATCATCGTTTTCTTTATCATCGGTCGACACTATTTACAGTC 

LeuLysGlyGluAlaMetHisGlyGlnValAspCysSerProGlylleTrpGlnLeuAsp 
1923 CTAAAAGGAGAAGCCATGCATGGACAAGTAGACTGTAGTCCAGGAATATGGCAACTAGAT 
GATTTTCCTCTTCGGTACGTACCTGTTCATCTGACATCAGGTCCTTATACCGTTGATCTA 
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CysThrHisLeuGluGlyLysI lei leLeuValAlaValHisValAlaSerGlyTyrl le 
1983 TGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATA 
ACATGTGTAGATCTTCCTTTTTAATAGGACCATCGTCAAGTACATCGGTCACCTATATAT 

GluAlaGluVall leProAlaGl uThrG 1 yG 1 nG 1 uThr A 1 aTy rPheLeuLeuLysLeu 
2043 GAAGCAGAAGTTATTCCAGCAGAGACAGGGCAGGAAACAGCATATTTTCTCTTAAAATTA 
CTTCGTCTTCAATAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATAAAAGAGAATTTTAAT 

AlaGlyArgTrpProValLysThrl leHisThrAspAsnGlySerAsnPheThrSerThr 
2103 GCAGGAAGATGGCCAGTAAAAACAATACATACAGACAATGGCAGCAATTTCACCAGTACT 
CGTCCTTCTACCGGTCATTTTTGTTATGTATGTCTGTTACCGTCGTTAAAGT6GTCATGA 

ThrValLysAlaAlaCysTrpTrpAlaGlylleLysGlnGluPheGlylleProTyrAsn 
2163 ACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCAGGAATTTGGCATTCCCTACAAT 
TGCCAATTCCGGCGGACAACCACCCGTCCCTAGTTCGTCCTTAAACCGTAAGGGATGTTA 

ProG 1 nSerG 1 nG 1 yVa 1 Va 1 G 1 uSe rMe tAsnAsnG 1 uLeuLysLys I lei leGlyGln 
2223 CCCCAAAGTCAAGGAGTAGTAGAATCTATGAATAATGAATTAAAGAAAATTATAGGACAG 
GGGGTTTCAGTTCCTCATCATCTTAGATACTTATTACTTAATTTCTTTTAATATCCTGTC 

ValArgAspGlnAlaGluHisLeuLysThrAlaValGlnMetAlaValPhelleHisAsn 
2283 GTAAGAGATCAGGCTGAACACCTTAAGACAGCAGTACAAATGGCAGTATTCATCCACAAT 
CATTCTCTAGTCCGACTTGTGGAATTCTGTCGTCATGTTTACCGTCATAAGTAGGTGTTA 

PheLysArgLysGlyGlylleGlyGlyTyrSerAlaGlyGluArglleValAspIlelle 
2343 TTTAAAAGAAAAGGGGGGATTGGGGGATACAGTGCAGGGGAAAGAATAGTAGACATAATA 
AAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCTTATCATCTGTATTAT 

AlaThrAspIleGlnThrLysGluLeuGlnLysGlnlleThrLysIleGlnAsnPheArg 
2403 GCAACAGACATACAAACTAAAGAACTACAAAA6CAAATTACAAAAATTCAAAATTTTCGG 
CGTTGTCTGTATGTTTGATTTCTTGAT6TTTTCGTTTAATGTTTTTAAGTTTTAAAAGCC 

ValTyrTyrArgAspAsnLysAspProLeuTrpLysGlyProAlaLysLeuLeuTrpLys 
2463 GTTTATTACAGGGACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAA 
CAAATAATGTCCCTGTTGTTTCTAGGGGAAACCTTTCCTGGTCGTTTCGAAGAGACCTTT 

GlyGluGlyAlaValVal I leGlnAspAsnSerAspI leLysValValProArgArgLys 
2523 GGTGAAGGGGCAGTAGTAATACAAGATAATAGTGACATAAAAGTAGT6CCAAGAAGAAAA 
CCACTTCCCCGTCATCATTATGTTCTATTATCACTGTATTTTCATCACGGTTCTTCTTTT 

AlaLysI lei leArgAspTyrGIyLysGlnMetAlaGlyAsoAspCysValAlaSerAr 
2583 GCAAAAATCATTAGGGATTATGGAAAACAGATGGCAGGTGATGATTGTGTGGCAAGTAG.. 
CGTTTTTAGTAATCCCTAATACCTTTTGTCTACCGTCCACTACTAACACACCGTTCATCT 

GlnAspGluAspAM 
2643 CA6GATGAGGATTAG 
GTCCTACTCCTAATC 
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Sequence of SOD/env-4 
SOD 

MetAlaThrLysAlaValCysValLeuLysGlyAspGlyProValGlnGlyllelleAsn 
1 CATGGCGACGAAGGCCGTGTGCGTGCTGAAGGGCGACGGCCCAGTGCAGGGCATCATCAAT 
CGCTGCTTCCGGCACACGCACGACTTCCCGCTGCCGGGTCACGTCCCGTAGTAGTTA 

PheGluGlnLysGluSerAsnGlyProValLysValTrpGlySerlleLysGlyLeuThr 
62 TTCGAGCAGAAGGAAAGTAATGGACCAGTGAAGGTGTGGGGAAGCATTAAAGGACTGACT 
AAGCTCGTCTTCCTTTCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGA 

GluGlyLeuHisGlyPheHisValHisGluPheGlyAspAsnThrAlaGlyCysThrSer 
122 GAAGGCCTGCATGGATTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGT 
CTTCCGGACGTACCTAAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCA 

AlaGlyProHisPheAsnProLeuSerArgLysHisGlyGlyProLysAspGluGluArg 
182 GCAGGTCCTCACTTTAATCCTCTATCCAGAAAACACGGTGGGCCAAAGGATGAAGAGAGG 
CGTCCAGGAGTGAAATTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCC 

HisValGlyAspLeuGlyAsnValThrAlaAspLysAspGlyValAlaAspValSerlle 
242 CATGTTGGAGACTTGGGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATGTGTCTATT 
GTACAACCTCTGAACCCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAA 

GluAspSerVallleSerLeuSerGlyAspHisCysIlelleGlyArgThrLeuValVal 
302 GAAGATTCTGTGATCTCACTCTCAGGAGACCATTGCATCATTGGCCGCACACTGGTGGTC 
CTTCTAAGACACTAGAGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAG 

m HisGluLysAlaAspAspLeuGlyLysGlyGlyAsnGluGluSerThrLysThrGlyAsn 
362 CATGAAAAAGCAGATGACTTGGGCAAAGGTGGAAATGAAGAAAGTACAAAGACAGGAAAC 
GTACTTTTTCGTCTACTGAACCCGTTTCCACCTTTACTTCTTTCATGTTTCTGTCCTTTG 

Env4--*- 

inn AlaGlySerArgLeuAlaCys61yValIleGlhIleAlaMetGluValValIleArgSer 
422 GCTGGAAGTCGTTTGGCTTGTGGTGTAATTGGGATCGCCATGGAGGTAGTAATTAGATCT 
CGACCTTCAGCAAACCGAACACCACATTAACCCTAGCGGTACCTCCATGATTAATCTAGA 

, „„ AspAsnPheThrAsnAsnAIaLysThrl lelleValGlnLeuAsnGluSerValAlal le 
482 GACAATTTCACGAACAATGCTAAAACCATAATAGTACAGCTGAATGAATCTGTAGCAATT 
CTGTTAAAGTGCTTGTTACGATTTTGGTATTATCATGTCGACTTACTTAGACATCGTTAA 

r , ~ AsnCysThrArgProAsnAsnAsnThrArgLysSerl leTyrl leGlyProGlyArgAla 
542 AACTGTACAAGACCCAACAACAATACAAGAAAAAGTATCTATATAGGACCAGGGAGAGCA 
TTGACATGTTCTGGGTTGTTGTTATGTTCTTTTTCATAGATATATCCTGGTCCCTCTCGT 
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PheHisThrThrGlyArgI lei leGlyAspI leArgLysAlaHisCysAsnl leSerArg 
602 TTTCATACAACAGGAAGAATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGA 
AAAGTATGTTGTCCTTCTTATTATCCTCTATATTCTTTTCGTGTAACATTGTAATCATCT 

AlaGlnTrpAsnAsnThrLeuGluGlnlleValLysLysLeuArgGluGlnPheGlyAsn 
662 GCACAATGGAATAACACTTTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGGAAT 
CGTGTTACCTTATTGTGAAATCTTGTCTATCAATTTTTTAATTCTCTTGTCAAACCCTTA 

AsnLysThrl leValPheAsnGlnSerSerGlyGlyAspProGluI leValMetHisSer 
722 AATAAAACAATAGTCTTTAATCAATCCTCAGGAGGGGACCCAGAAATTGTAATGCACAGT 
TTATTTTGTTATCAGAAATTAGTTAGGAGTCCTCCCCTGGGTCTTTAACATTACGTGTCA 

PheAsnCysArgGlyGluPhePheTyrCysAsnThrThrGlnLeuPheAsnAsnThrTrp 
782 TTTAATTGTAGAGGGGAATTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGG 
AAATTAACATCTCCCCTTAAAAAGATGACATTATGTTGTGTTGACAAATTATTATGTACC 

ArgLeuAsnHisThrGluGlyThrLysGlyAsnAspThrl lei leLeuProCysArgl le 
8£*2 AGGTTAAATCACACTGAAGGAACTAAAGGAAATGACACAATCATACTCCCATGTAGAATA 
TCCAATTTAGTGTGACTTCCTT6ATTTCCTTTACTGTGTTAGTATGAGGGTACATCTTAT 

LysGlnllelleAsnMetTrpGInGluValGlyLysAlaMetTyrAlaProProIleGly 
902 AAACAAATTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATTGGA 
TTTGTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAACCT 

GlyGlnl leSerCysSerSerAsnl leThrGlyLeuLeuLeuThrArgAspGlyGlyThr 
962 GGACAAATTAGTTGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTACA 
CCTGTTTAATCAACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATGT 

AsnValThrAsnAspThrGluValPheArgProGlyGlyGlyAspMetArgAspAsnTrp 
1022 AATGTAACTAATGACACCGAGGTCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGG 
TTACATTGATTACTGTGGCTCCAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACC 

ArgSerGluLeuTyrLysTyrLysVal I leLysI leGluProLeuGlyl leAlaProThr 
1082 AGAA6TGAATTATATAAATATAAAGTAATAAAAATTGAACCATTAGGAATAGCACCCACC 
TCTTCACTTAATATATTTATATTTCATTATTTTTAACTTGGTAATCCTTATCGTGGGTG6 

LysAlaLysArgArgValValGlnArgGluLysArgOP OP 
1142 AAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAT6ATGAAGCTTG 

TTCCGTTTCTCTTCTCACCACGTCTCTCTTTTTTCTACTACTTCGAACAGCT 
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